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232451 1 


BEQ91G89 BE091123 AA319959 
AW5D3329 N46610 AA331571 






AW753967 AA370795 AA33163Q AW962550 
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4W?fl1 


247933^1 


AA352 1 1 "T AWSB2247 AA429695 
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AA382B14 AA402411 AA412355 
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AA470519 BE303010 BE302954 BE38412Q 
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4Tlfi7fi 
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470321 1 


BE164500 AA832198 BE1645Q2 


444910 


624951~1 


AI201849 3EQS9Q07 AW946544 




63943 1 


AV653771 BED89370 


446922 




BE1756Q5 Z43529 F06810BE175602AV661027 






R36075 AI366546 R36167 


448420 


76273 T 


BE6230D4 AA380669 BE263627 BE246433 




766241 1 


AW898595 AW898588 AW89859Q AW898663 AW898592 A1525093 




837264 1 


Af698839 AI9D9260 A1909259 


450736 


844652 1 


AW970060 AI7323S6 AI792313 AW839644 


4^1f)R7 




AA442176 AA259181 






BE172186 AA059279 AA020815 AA013437 




86640 1 


AW936273 AW340350 AA017208 


452542 




AMffUVJSR AlA/S'199^7 A}QHFiA'?1 Atonfi<f99 
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AL037925 AL037931 AL037957 
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AWB55717 AW362452 AW362443 






N71277 AW390764 


454545 


1223779„1 


AWB06899 AW866451 AW866393 AW866297 AWB17869 




12291 32 1 


AWB1342B AW813444 AW813367 AWB133B8AWB13429 AWB13424 




1230493 1 


AW815G98 BE154B43 BE154831 




1250536 1 


AW852530 AW852527 AW352526 




1252971_1 


BE1 52423 AW855572 AW6556Q7 


455097 
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AW855802 AWB55794 AW355797 AW855806 AW355796 AW855808 AW355793 AW855807 
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BE 1 60198 AW935893 T1 1 520 AW935930 AW858073 AW8S1034 




1 2B9854_"i 
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BE144762 AW979D91 
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1348732J 


BE0649B2 BE064979 BE064853 B£064B57 BE064856 BE064977 BE064930 BE054860 BE0B4815 BED64957 BEOB4804 






BE064816 BE06485C BE064805 BE064796 BE064B1B BE064975 BE0B4819 BE06'4810 BE064668 BE065059 


4556S5 


1350393 1 


BE066976 BE066928 BE066927 


455700 


1351264J 


BE0B8115 BE068104 BE068t02 BE068096 BEQS81G3 BE068154 BE068198 


45570B 


1352232J 


BE06932GBE069290BEOB9352 


455732 


1353874 1 


BE08090& BE07225B BE072190 BE072236 


455636 


1374805J 


BE14580BBE145807BE1818B3 


455935 


1384144 1 


BE15S6B7 BE15SG88 


455945 


1385588J 


BE160636 BE160606 BE160703 


4S6207 
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456482 
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AA485224 M287306AA258121 
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4731 U 
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458673 
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TABLE 19B 



Table 19B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 19. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. "Dunham I. et al.° refers to the publication 

entitled 'The DNA sequence of human chromosome 22." Dunham I. et al„ Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 


Strand 


NLposition 


400555 9801191 


Minus 


134694-134817 


400608 9B87666 


Minus 


96756-97558 


400610 9887671 


Minus 


117606-117928,124040-124147 


400925 7651921 




381 83-38391 ,43900-44086 


401045 8117619 


Plus 


90044-90184,91111-91345 


401049 7232177 


Plus 


149157-150692 


401093 8516137 




22335-23166 


401256 9796573 


Minus 


45482-45620 


401283 9800093 




47256-47456 


401326 9212516 


Minus 


226246-227505 


401418 7452889 


Minus 


124865-125075 


401451 6634068 


Minus 


119926-121272 


401458 9187886 


Plus 


76485-77597 


401497 7381770 


Plus 


92607-92813 


401508 7534110 


Minus 


110779-110983 


401575 7229804 


Minus 


76253-78364 


401747 9789672 


Minus 


118596-118816,119119-119244,119609-119761,120422-120990,130161-130381,130468-130553,1310'!?.. 






131256,131866-131932,132451-132575,133580-134011 


401781 7249190 


Minus 


83215-83435,83531-83656,63740-83901,84237-84393,84955-85037,86290-86814 


401785 7249190 


Minus 


165776-165996.166189-166314 t 15640B-166569,167112-167268,167387-167469,168634-168942 


401793 7263888 


Minus 


102945-103083 


401987 4406829 


Minus 


72893-73021 ,7693B-77049 


402077 8117414 


Plus 


65014-65195 


402109 8131678 


Minus 


171722-171859,173197-173303 


402184 8576001 




112844-112986,113505-113636 


402376 9625329 


Minus 


21753-22385 


402421 9796341 


Minus 


466094666'2,4675B46811,86293-86346,B9776-89829 [ 90048-90101,1O2817-102924 


402578 9884928 


Plus 


66350-66496 


402606 9909429 


Minus 


81747-82094 


402696 7328818 


Minus 


23600-23731 


402765 9367757 


Plus 


109588-109726 


402800 6010175 


Plus 


43921-44049,46181-46273 


402820 6456853 


Minus 


82274-82443 


4D2892 8086B44 


Minus 


194384-194645 


403133 7331427 


Plus 


38314-38634 


403356 8569930 


Plus 


92839-93036 


403388 9438331 


Plus 


112733-113001,114599-114735 


403426 9719529 


Minus 


157156-158183 


403585 8101208 


Minus 


131266-131769 


403593 6862650 


Minus 


62554-62712,69449-69602 


403637 8671936 


Miiius 


142647-142771,145531-145762 


403639 8671948 


Plus 


113234-113326,115186-115287,119649-119786 


403677 7331517 


Minus 


55008-55083,62860-63051 


403775 7770580 


Minus 


102247-102326,103095-103148 


403943 7711864 


Plus 


100742-100904,101322-101503 
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404091 7684554 


Minus 


82121-83229 


404097 7770701 


m'| US 


55512-55781 


404142 9856692 




80316-80459 


404253 9367202 


Minus 


55675-56055 


404274 9885189 


Pus 


104127-104318 


404285 2326514 




32282-32416 


404360 985B45Q 


Minus 


122873-122966,151324-151469,153093-153253 


404440 7528051 




80430-81581 


404443 7579073 


Minus 


87198-87441 


404552 7243881 


Plus 


19354-2C01O 


404561 9795980 


Minus 


69039-701QO 


404580 6539738 


Minus 


240588-241589 


404721 9856648 


Minus 


173763-174294 


404826 6572184 




47726-43046 


404983 4432779 


Minus 


51178-51374,52000-52173 


405037 7543748 


Minus 


127374-127578 








405095 8072599 




138877-139066 


405153 9965565 


Minus 


175317-175500 


405196 7230083 


Minus 


135716-135851 ■ 


405232 7249042 


Plus 


125904-126063 


405248 7259728 
405336 5094635 


Pus 


33267-33563 


405394 6624123 


Minus 


31900-32373 


405460 7684589 


Minus 


52223-52389 


405494 8050952 


Minus 


70284-70518 


405547 1054740 




124361-124520,124914-125050 


405609 5757553 


Minus 


42814-43010,43583-43783,4486345033,4642946554,47815-4801 8,49961 -50 1 53,51624-5 1727,51823-51 959,52702- 
52918,55469-55601,57111-57307,58169-58296,60215-60332,61482-61727 


40563B 6289229 


Plus 


1 99260-199372,199826-1 99929 


405654 4895155 


Minus 


53624-53759 


405718 9795467 


Plus 


113080-113266 


405822. 6273498 


Minus 


154660-154974.155203-155379 


405848 7651809 


Minus 


28135-28244 


405873 6758747 


Minus 


32129-32764 


405906 7705124 


Minus 


10835-11059 


405917 7712162 


Minus 


106829-107213 


405925 6758795 


Plus 


129935-130282 


405953 7960374 


Minus 


65101-65574 


406D69 9117732 


Plus 


68880-69374 


406151 7144806 
406153 9929734 


Minus 
Minus 


94087-94285 
12902-13069 


405162 5923650 


Minus 


28256-28935 


406271 7534217 
406291 5685274 
406348 9255985 


Plus 
Plus 
Minus 


36179-36692 

9562-9867 

71754-71944 


406414 9256407 


Plus 


4959349850 


406446 9454509 


Minus 


1 16424-1 1 6527,1 1 8721 -1 1 8B59, 121 187-1 21364 


406504 7711360 


Minus 


107068-107277 


406554 7711566 


Plus 


106956-107121 
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TABLE 20: 544 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
LIKELY TO ENCODE EXTRACELLULAR OR CELL- 
5 SURFACE PROTEINS 

Table 20 shows 544 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode extracellular or cell-surface proteins. These were selected as for 
Table 19, except that the ratio was greater than or equal to 3.0, the "average" normal adult 
10 tissue level was set to the 85 th percentile value amongst 144 non-malignant tissues, and the 
96 th percentile value amongst the 73 breast cancer specimens was greater than or equal 100 
units, and the predicted protein contained a structural domain that is indicative of 
extracellular localization (e.g. ig, fh3, egf, 7tm domains, signal sequences, transmembrane 
domains). The predicted protein domains are noted, 

15 

Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 
20 Pred.Prot.Domains: Predicted Protein Domains 
Unigene Title; Unigene gene title 

R1 : Ratio of 93 rd percentile of turner to 85* percentile of normal body tissue 



25 Pkey ExAccn UnlgenelD Pred,Praf.Domains 



408591 AF015224 Hs.46452 

400291 M401369 Hs.19Q721 
449746 A166B594 Hs,176588 
4D7277 AW170035 Hs.326736 

400292 AA250737 Hs.72472 
424735 03W5 Hs.272499 



407178 
4OB000 
427585 
429441 
450375 
420331 
422109 
445730 
451110 



452744 
424634 
420757 
424399 
447350 



435496 
402578 
453160 



NM_000230Hs.19423B 

M195651 Hs.104106 

L11690 Hs.620 

D31152 Hs.179729 

AJ224172 Hs.204096 

AA009647 Hs.8850 

AF044197 Hs.100431 

S73265 Hs.1473 

AI624342 Hs.170042 

AI955040 Hs.265398 

AI127076 Hs.334473 

X51501 Hs.99949 

AI267652 Ha.30504 
NMJ03613HS.151407 

X78592 Hs.99915 

AI905687 Hs.2533 

AI375572 Hs.172634 
AA193450 

AL137517 Hs.334473 

AA399272 Hs.144341 

X52509 Hs.161640 

AW840171 Hs.265398 

AI263307 Hs.239864 

AL120862 Hs,124165 
NM_014398Hs.10S87 



SS,Uterog!cbin J SS,Uteroglobin 



TM 

SSXeptin.SS.Leptin, 
,SS,Dihydroorolase, 

Plectln_repeat,SH3,spectrin,SS,Plectln_r 
SS,C1q,Collagen,SS,C1q,Collagen, 
"i.Uteroglcbin, 



ESTs, Weakly similar to CP4Y_HUMAN CYTOC 
Homo sapiens breast cancer antigen NY-BR 
BMP-R1B 

short-chain alcohol dehydrogenase family 
gb:QV3-BT0381-27Q100-073-c08 BT0381 Homo 
leptin (murine obesity homolog) 
ESTs 

bullous pemphigoid antigen 1 (230/24OkD) 
collagen, type X, alpha 1 (Schmid metaph 
lipophilin B (uteroglobin family member) 



,SS,TM,Calian_efflux 



,SS r TM t GNSLSUR4,cNMP_blnding,Rlla 

ig.tspj.ss.AM 

hormone_recAndrogen_recep,zf-C4, 



,SS,TM,disintegrin,Pep_M12B_propep,Repro a disintegrin and mi 

' small inducible cytokine B subfamily (Cy 

gasirin-releaslng peptide 
ESTs 

ESTs, Weakiy similar to transfoimatlon-r 
hypothetical protein CKFZp56401278 
prolactin-induced protein 
Homo sapiens mRNA; cDNA DKFZp434E062 (fr 
cartilage intermediate layer protein, nu 
androgen receptor (dihydrotestosterone r 
aldehyde dehydrogenase 9 family, member 
ESTs 

gb:zr40e07.ri Soares_NhHMPu„S1 Homosapl 
I in hypothetical protein DKFZp56401278 

SS ESTs 
,SS,TM,amino!raiLl„2,Cadherin_C„term 1 cad tyrosine aminotransferase 
SS ESTs, Weakly similar to transformations 

SS,p450,SS,TM,p450 C1001134:gl|21l7372|pir||l659B1 fatly ac 

SS H2B histone family, member L 

SS programmed cell death 9 (PDCD9) 

Lam^SS.TMXamp, similar to lysosome-associated membrane 
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449765 N92293 Hs.20683Z 
428227 AA321649 Hs.2248 
425892 D9D041 Hs.155956 
424001 W67BB3 Hs.1 37476 
448595 ABQ14544 Hs.21572 
449448 DB0730 Hs.57471 
418007 M13509 Hs.83169 
418994 AA29652Q Hs.B9546 
453596 AA441S38 Hs.S2905 
452401 NM„007115Hs.29352 
446591 H44186 Hs. 15456 
419296 AA236115 Hs.120785 
452838 U65011 Hs.30743 
422B05 AA4369B9 Hs.121017 
448390 AL035414 Hs.21068 
447342 AI199268 Hs.19322 
411869 W20027 Hs.23439 
443709 AI082692 Hs.134662 
459587 AA031956 
442580 AI733682 Hs.1 30239 
400289 X07820 Hs.2258 
411598 BE336654 Hs.70937 
415263 AA948033 Hs.130853 
433805 AA706910 Hs.1 12742 
407276 A1951118 Hs.326736 
443348 AW873596 Hs.182278 
421037 Ai684808 Hs.197653 
424086 AI351010 Hs,102267 
400295 W72838 Hs.2533 
452461 N78223 Hs.108106 
427365 AI873274 Hs.190721 
433365 AF026944 Hs.293797 
409269 .AA576953 Hs.22972 
432596 AJ224741 Hs.278461 
408771 AW732573 Hs.475B4 
432912 BE007371 Hs.200313 
447033 Ai357412 Hs. 157601 
421155 H67879 Hs.102257 
424905 NM_0O2497Hs.1537O4 
425398 AL049S89 Hs.156369 
438167 R28363 Hs.242BS 
459583 A1907673 
423945 AA410943 
439B20 AL360204 Hs.283853 
4O2606 

445263 H57646 Hs.42586 
430217 N47863 Hs.336901 
447164 AF026941 Hs.1751S 
431385 BE178536 Hs.11090 
423887 AL080207 Hs.134585 
415385 R17798 Hs.7535 
425704 U79293 Hs.159264 
429859 NKJQ7050HS.225952 
425523 AB007948 Hs.158244 
428368 BE440042 Hs.83326 
418912 NMJ0O885HS.89472 
42Z026 U80736 Hs.110826 
451952 AL120173 Hs.301663 
438199 AW016531 Hs.122147 
400608 

413472 8E242870 Hs.75379 
432374 W68815 Hs.301B85 
402408 NA 

445537 AJ245671 Hs.12844 
451621 AI879148 Hs.26770 
405654 NA 

434988 AI418055 Hs.161160 



SS ESTs, Moderately similar to ALUBJHUMAN A 

SSJL8, small Inducible cytokine subfamily B (CX 

,SS,Ac3tyl1raji5f2, N-aoetyltransferase 1 (arylamine N-acety 

,pkinase, paternally expressed 10 

LRRCT,LRR,SS,LRRCT,serine_carbpept K1AA0B44 gene product 

SS ESTs 

SS,hemopexh,Peptidase_M10,SS,Peptidase_malrlx metalloproteinase 1 (MMP1; inters 

SS,!ecfin„c,sushl,EGF,SS,EGF,lectin_c,su selectin E {endothelial adhesion moiecul 

SS hypothetical protein FL) 14834 

,SS,CUB,xlhk, tumor necrosis factor, alpha-induced pro 

PDZ,SS PDZ domain containing 1 

SS ESTs 

SS.SS preferentially expressed antigen in mela 

hlstone,SS,histone,hlstone H2A hlstone family, member A 

SS hypothetical protein 

.SSjipocalin Homo sapiens, Similar to RIKEN cDN A 201 0 

,SS.Peptidase_M1, ESTs 

,SS,TM,SNF ESTs 

,SS,LIM, gb:zM5e04.s1 Soares_pregnant_uterus_NbH 

SS ESTs 

hemopexin,PepUdase_M10,SS,Peptldase_M10matrix metalloproteinase 10 (MMP10; str 

histone,SS,hisfone,histone H3 histone family, member A 

.SS.histone.nistoneJinkerJiistone ESTs 

,SS,RibosomalJ.7Ae, ESTs 

TM Homo sapiens breast cancer antigen NY-BR 

,SS,DENN calmodulin 2 (phosphorylase kinase, delt 

SS programmed cell death 9 (PDCD9) 

,SS,Lysyl_oxidase iysyi oxidase 

aldehyde dehydrogenase 9 family, member 



iS,G9a,PHD. 



ESTs 
ESTs 

hypothetical protein FU 13352 
matrilin 3 

potassium voltage-gated channel, delayed 
ESTs 
ESTs 

SS,LysyLoxidase,Aldose_epim,Epimerase,S lysyl oxidase 
pklnas^SS.T.Hpkinase.polyprenyLsynf, NIMA (never In mitosis gene a)-re!ated k 



SS,TM,UPF001 6,SS,TM,U PF001 6 
SS,EGF,w/s,SS,TM,vwa, 
TMXtetra,ion_trans,SS r TM,l^te1ra,i 
,SS,TM,Folate_carrier 



hypothetical protein similar to tenascln 
ESTs 

gb:ll-BT152-OB0399-004 BT152 Homo saplen 
gb:zt32h03.r1 Scares ovary tumor NbHOT H 
Homo sapiens mRNA full length insert cDN 
NMj)24S26:Homo sapiens hypofhetlcal prot 
KIAA1560 protein 
l RNA_poLA,RNA_pbLA2,RibosQmaLS24e,ribosoma] protein S24 



,SS,TM,7tm_1,p450,rrm 
,pkinase, 

death,ZU5,TM,Activln_recp,pkinase, 



,TM,lBR 
,SS,TM 

,SS,TM,BRCT,ank,ABC_tjan,ABCJran 
,SS.Fork_head, 

,SS,TM,Y_phosphatase,MAM,fn3, 



Homo sapiens cig5 mRNA, partial sequence 
membrane-spanning 4-domains, subfamily A 
DKFZP434G232 protein 
COBW-like protein 

Human dona 2394B m.RNA sequence 
protein tyrosine phosphatase, recBptort 



3S,laminin_B,lamlrrirLEGr,lamin:ri_Nterm KIAA0479 protein 
;s,Peptidase_M10,bernopexin,SS,Pepridase_matrix metalloproteinase 3 (stromelysin 



SS,TM,7troJ f SS,TM,7trn_1, 



angiotensin receptor 1 
trinucleotide repeat containing 9 
ESTs 
ESTs 

C10001899:gi|750B633[pirl|T25392 hypothe 
solute carrier family 1 (glial high affr 
Homo sapiens cDNA FU11346 fis, clone PL 
NtvlJttOgzo^Homc sapiens hypothetical pro 
EGF-like-domain, multiple 6 (EGFL6) 
fatty ac : d bii jino protein 7, brain 
C12001521:gi|7513934|pir)lT31081 cca3 pr 
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416220 N49776 Hs,170994 ,SSJM hypothetical protein MGC10946 9.5 

431608 M30703 Hs.270833 SS,TM,EGF,SS amphiregulin (schwannoma-derived growth 9.5 

414142 AW368397 Hs.150042 ,SS,UDPGT Homo sapiens cDNA FLJ14438 (is, clone HE 9.4 

400298 AA032279 Hs.61635 TM six transmembrane epithelial antigen of 9.4 

5 418601 AA279490 Hs.86368 SS,TM ; calrellculm,SS,TM,calreticulin, oalmegin 9.4 

415539 AI733BB1 Hs.72472 death.ZUSJM.Aciivinjecp.pWnase, BMP-R1B 9.4 

421451 AA291377 Hs.50831 TM ESTs 9.3 

429432 AI67B059 Hs.202676 SS synaptonemal complex protein 2 9.3 

442441 AI820BB2 Hs.129598 SS ESTs 9.1 

10 426429 X73114 Hs.169B49 ,SS,TM,fn3,lg, myosin-binding protein C, slow-type 9.1 

40B687 M31126 Hs.272620 SS,Peptidase_M10,hem D pexin,SS,Peptidase_matrixmetal]oproteinase 11 (MMP11;stro 9.1 

448693 AW004854 Hs.228320 SS hypothetical protein FLJ23537 9.1 

419948 AB041035 Hs.93847 Fe]tlc_rediict ( TM,Ferriejeduct, NM_015931:Homo sapiens NADPH oxidase 4 ( 9.1 

426214 H59346 Hs.128355 SS ESTs, Moderately similar to ALU7_HUMAN A 9.0 
15 427718 AI798680 Hs,25933 ,SS,TM,histone,Sec1,histone J sugar_tr ESTs 8.9 

414812 X72755 Hs.77367 SS,IL8,SS,IL8 monokine induced by gamma interferon 8.8 

400235 NA ,TM,ABC_trati^BC_membrahe, Eos Control ' 8.8 

422330 D307E3 Hs.115263 SS,TM,EGF,SS,TM eptreguliri 8.8 

416162 NM_004354Hs.79069 cyclln.SS cyciin G2 8.B 

20 420077 AW512260 Hs.87767 SS ESTs 8.7 

452281 T93500 Hs.28792 ,SS,TGF-beta,TGFb_propeptide, Homo sapiens cDNA FU11041 fis, clone PL 8.7 

434531 AA642007 Hs.116369 SS ESTs 8.6 

408330 AF123050 Hs.44532 ,SS,TM,ublqLii0^7ta_3,ANF_receptor,sush diubiqultin 8.6 

443788 AI732643 Hs.144151 TM ESTs 8 6 

25 411078 AI222020 Hs.182364 SS,SS CocoaCrisp 8.5 

445495 BE622641 Hs.36489 SS,SS,ENTH,l_LWEQ,ENTH,l_LWEQ,DNA^.mis_reESTs, Weakiy similar to '38022 hypoihefl 8.5 

433426 H69125 Hs.133525 ,SS,TM ESTs 8,5 

424671 NMJ04525Hs,153595 SS,EGFJdljKepLa,idU-ecepLb,SS,TM,E low density lipoproteln-related protein 8.4 

426215 AW963419 Hs.155223 SS stannlocalcin 2 8.4 
3 0 409045 AA635062 Hs.50094 TM Homo sapiens mRNA; cDNA DKFZp434O0515 (f 8.4 

435525 AI831297 Hs.123310 TM ESTs 8.3 

409203 AA780473 Hs,687 SS,p450,SS,p450 cytochrome P450, subfamily IVB, polypept 8.3 

424902 NM„003866Hs.1 53687 SS,SS inositol poiyphosphale-4-phosphatase, ty 8.3 

431725 X65724 Hs.2839 SS ( Cys_knoi,SS Nome disease (pseudoglioma) 8.3 

35 418092 R45154 .Hs.106604 ,death,ZU5,pkinase^cbvin_recp, ' ESTs 8.3 

439840 AW449211 Hs.105445 SS GDNF family receptor alpha 1 8.2 

427811 M81057 Hs.180BB4 SS,Zn_carbOpept ( Propep_M14,SS J Propep„M14carboxypepticlaseB1 (tissue) 8.2 

420807 AA280627 Hs.57846 SS,cpn10 ESTs 8 2 

426320 W47595 Hs.169300 SS.TGF-betaJGFb.propeptide.SS transforming growth factor, beta 2 8.2 

40 447078 AW385727 Hs.301570 ,SS,kazal, ESTs 8.1 

415786 AW41919S Hs.257924 SS hypothetical protein FU137B2 8.1 

410102 AW248508 Hs.279727 SS Homo sapiens cDNA FLJ14035fis, clone HE 8,0 

404347 SS Target Exon 8.0 

433687 AA743991 TM gb:ny57g01.s1 NCLCGAP Pr18 Homo sapiens 8.0 

45 421373 AA808229 Hs,167771 .SSJMPDH^CIMPOHLN.CBS ESTs 8.0 

422634 NM_016010Hs.11B821 SS CGI-62 protein 7 9 

453310 X70697 Hs.553 TM.SNF.SSJM.SNF, solute carrier family 6 (neurotransmitte 7.9 

'435957 N39015 Hs.190368 ,SS,TM ESTs 7.8 

407771 AL13B272 Hs.62713 ,TM,cpn60_TCP1,Sema, ESTs 7,8 

50 443646 AI085198 Hs.164226 ,TSPN,vwc,tspJ,EGF,thiored, ESTs 7.8 

446142 AI754693 Hs.145968 .TM.cadherin.CEdherin CJerm, ESTs 7.7 

444649 AW207523 Hs.197628 ,SS,rrm, ESTs 7.6 

435147 AL133731 Hs.4774 ,TM,SDF,UPAR LY6, Homo sapiens mRNA; cDNA DKFZp761C1712 (f 7.6 

439138 AI742605 Hs.193696 TM ESTs 7,6 

5 5 429220 AW207206 Hs.136319 SS ESTs 7,6 

42B804 AK000713 Hs,193736 .SS.UDPGT hypothetical protein FU20706 7.5 

453511 AL031224 Hs,33102 SS.SS transcription factor AP-2 beta (actiratl 7.5 

439809 R41396 Hs,101774 SS hypothetical protein FLJ23045 7.5 

414B59 AA157291 Hs.2147S SS ubinudein 1 7.5 

60 416276 U410B0 Hs.79136 SS,TM,TM L1V-1 protein, estrogen regulated 7.5 

452862 AW378065 Hs.8687 ,SS,Pep_M12B_propep,Reproiysin,ispJ, ESTs 7.4 

■ 452926 AI742170 Hs.31297 ,SS,TM duodenal cytochrome b 7,4 

453331 AI240665 Hs.8895 ,SS,TM,dislntegrin,Pep_M12B_propep,Repro ESTs 7.3 

420802 U22376 Hs.1334 SS,NA,myb_DNA-binding v-myb avian myeloblastosis viral oncogen 7.3 

65 450603 R43646 Hs,12422 SS ESTs * 7.2 

422867 L32137 Hs.1584 SS,EGF,tspJ,SS,E2F_TDP 1 cartilage oligameric matrix protein (COM 7.2 

418004 U37519 Hs.87539 SS,aldedfi,SS,a!dedh, aldehyde dehydrogenase 3 family, member 7.2 
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4Z6451 
450701 
419519 
410555 
433138 
411558 
409079 
417275 
432731 
442818 
407366 
427427 
410785 
401045 



AI908165 Hs.169946 SS.GATA, 



,TM,dis!ntegrin,Rep 



Hs.70725 
W87707 Hs.82065 
X63578 Hs.295449 
R31178 Hs.287820 
AK001741 Hs,8739 



C1 10018B3*:gi!6753278|ref|NP_033938.1] c 



GATA-bindlng protein 3 fT-cell receptor 
Homo sapiens cDNA FU12280 fis, clone MA 
ESTs 

a dlsintegrin and mstailoproteinase doma 
semaphorin sem2 

gamma-aminobutyric acid (GABA) A recepto 
Inferleufo'n 6 signal transducer (gp130, 
parvalbumin 
fibranecUn 1 

hypothetical praleh FU10879 
gb:Homo sapiens clg33 mRNA, partial sequ 
ESTs 

gb:IL2-iJM0079-09030M50-D03 UM0079 Homo 
lCE_p20,SS,lCE p10,lCE_p2Q,lCE_p10,ICE„p 



SS,TM,SS,TM 
,TM,fo3, 

SS,efhand,SS,ethand,ras 

l SS,fi;3,fn1,i[.2,fn2,fn1 

WD40.SS 

JM.IBR 

SS,leclin_C,SS 



6.7 



418986 AI123555 Hs 81796 


,SS,Repro:ysln,lsp_1, 


ESTs 




442082 R41823 Hs.7413 


,TM,EPH_lbd,pMnase,SAM,fn3, 


ESTs; calsyntenin-2 


67 


442861 AA243837 Hs.57787 


SS 


ESTs 


6.6 


418836 AI655499 Hs 161712 


,TM,Adivin recp,pMnase,death,2U5, 


ESTs 




422080 R20893 Hs.325823 


,SS,TM,CD36 


ESTs, Moderately similar to ALU5JHUMAN A 


6.6 


444381 BE387335 Hs.283713 


,SS,mitoj;Brr 


ESTs, Weakly similarto S64054 hypotheii 




404091 NA 


,TM 1 7trrL3/NF receptor, 


Target Exon 


6.6 


417866 AW067903 Hs 82772 


SS,CDllagen,COLFI,TSPN,SS,TSPN 


collagen, type Xt, alpha 1 




428819 AL135623 Hs.193914 


SS.SS 


KIAA0575 gene product 


65 


410275 U65658 Hs 61796 


,SS,RlbosomaLS4e 


transcription factor AP-2 gamma (acfivat 




425236 AW067800 Hs 155223 


SS 


stannlocalcjji 2 


62 


415669 NM 005025Hs,78589 


,SS,serpin, 


serine (or cysteine) proteinase inhiblto 


62 


416319 AI815601 Hs 79197 


SS,TM,ig,SS,TM 


CD83 antigen (activated B lymphocytes, i 




412140 AA219691 Hs.73625 


,SS,kinesln, 


RAB6 interacting, kinesln-like (rabWnes 


62 


442942 AW1670B7 Hs,131562 


1 SS,ig,Sema,pkinase, 


ESTs 


6.2 


446163 AA026880 Hs. 25252 


,SS,TM,fn3, 


prolactin receptor 


e!i 


443162 T49951 Hs.9029 


filament.SS.fflament.filament 


DKFZP434G032 protein 




409602 W26713 Hs 256972 


,SS,TM,DAGKa,DAGKc, 


ESTs 




428479 Y00272 Hs.184572 


,SS,pklnase,pklnase 


cell division cycle 2, G1 to S and G2 to 


6.1 




; SS 1 TM,pkinase 1 Recep_l_domaln,SH2,PH,FurHER2 receptor tyrosine kinase (c-erb-b2, 




433404 T32982 Hs.1 02720 




ESTs 


fil 


410079 U94362 Hs 58589 


Glyco_tfansf_8,SS 


glycogenln 2 


_'] 




,SS,tilarnent,Pribosyltran,filamerit,Armad 


Target Exon 






SS.adenylatekinase, 


adenylate kinase 5 




402230 NA 


,SS.TM,p450, 


Target Exon 


S.I 


427674 NMJ)03528Hs,2178 


hisione.SS.histone, 


H2B histone family, member Q 


6.1 


428398 AI249368 Hs.98558 


,SS,TM 


ESTs 


6.0 


458098 BE550224 Hs.74170 


SS 


metallothionein 1E (functional) 


6.0 


419968 X04430 Hs.93913 


SS.IL6JL6, 


interteukln 6 (interferon, beta 2) 


6.0 


416636 N32536 Hs.42645 


,SS,TM 


solute carrier family 16(monocarboxyiic 


6.0 


419703 AI793257 Hs.128151 


,SS,zf-C2H2, 


ESTs 


5.8 


4246S7 J05070 Hs.151738 


SS,Peptidas8_MiaFfi2,hemopexin,SS 1 TM,P 


epmairix mefallopraietnase 9 (gelatinase B 


5.8 


449679 AI823951 Hs.129700 


SS 


tolloid-like 1 


5.8 


421296 NM_00266SHs.103253 


SS 


perilipin 


5.8 


442117 AW664964 Hs.126899 


,SS,TM 


ESTs 


5.7 


400303 AA242758 Hs.79136 


,SS,TM 


LIV-1 protein, estrogen regulated 


5.7 


419440 AB020689 Hs.90419 


SS 


KIAA0882 protein 


5.7 


444858 AI199738 Hs.2DS275 




ESTs, Weakly similarto ALUA_HUMAN Mil 


5.7 


432239 X81334 Hs.2936 


SS,Peptidase„M10.liemopexIn,SS,Peptldase_matrix metalloproteinase 13 (collagenase 


5.6 


440705 AA904244 Hs.153205 


TM 


ESTs 


5.6 


400286 NA 


SS,TM,ABC_iran,ABC_membrane,SS 


C1 6000922:gi]7499 103|plr||T20903 hypothe 


5.6 


446466 H3B026 Hs.308 


arrestin.SS 


arrestin 3, retinal (X-arrestin) 


5,5 


423201 NIW_000163Hs.125180 


SSJMMSS 


growth hormone receptor 


5.5 


433043 W57554 Hs.125019 




lymphoid nuclear protein (LAF-4) mRNA 


5.5 


439509 AF0B6332 Hs.58314 


,SS,TM,Syntaxin 


ESTs 


5.4 


425247 NM_005940Hs. 155324 


SS,Peptidase_M10,fiemopexin,SS 


matrix metalloproteinase 11 {MMP11; stro 


5.4 


409757 NMJ01898HS.123114 


,SS,cystalin, 


cystatin SN 


5.4 


425292 NM_005824Hs.155545. 


SS 


37 kDa leucine-rich repeat (LRR) protein 


5,4 


448045 AJ297436 Hs.20166 


,SSJM 


prostate stem ceil antigen 


5.4 


452681 AF153330 Hs.30246 


,SS,TM 


solute carrier famiiy 19 (thiamine trans 


5.3 


452243 AL355715 Hs,28555 


SS 


programmed cell death 9 


5.3 


439310 AF0B6120 Hs.102793 


,SS,TM,UDPGT,caseinJ<appa 


ESTs 


5.2 
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441111 AI806B67 Hs.126594 ,SS,TM,Phosphodiest, ESTs 5.2 

452355 N54926 Hs.29202 TM,7tm_1,TM G protein-coupled receptor 34 5.2 

427711 M31659 Hs. 130408 SS solute carrier family 25 (mitochondrial 5.2 

418636 AW749B55 ,SS,TM,HECT gb:QV4-BT0534-281299-053-c05 BT0534 Homo 5.2 

429353 AL1 17406 Hs.200102 .SS,TM,ABC_tran,ABC_membrane, ATP-bindlng cassette transporter MRP8 5.1 

441690 R81733 Hs.33106 ,SS,HECT,zf-UBR1,PABP, 14-3-3, ESTs 5.1 

430447 W17064 Hs.332848 SS SWI/SNF related, matrix associated, acii 5.1 

429698 AI685086 Hs,26339 ,SS,ras, ESTs, Weakly similar to S2134B probable 5.1 
425325 X52730 Hs.1892 SS,NNMT^PNMT_TEMT,SS,MNMT_PMMT_TEMT,STAR 

phenylethanolamlne N-melhyltransferase 5,1 

423600 AI633559 Hs.310359 SS ESTs 5.1 

414737 AI160386 Hs.125087 SS ESTs 6.1 

403593 MA ,CIDE-N,pf(lnase Target Exon 5.1 

407758 D50915 Hs.38365 SS.SS KSAA01 25 gene product 5.0 

445234 AW137636 Hs.146059 ,SS,TM ESTs 5.0 

411165 NMJ00169Hs,69089 SS,Melibiase ) BTKPH l pklnase,SH2,SH3,Ribo gafactosidase, alpha 4.9 

420833 NM_014581Hs.27448O SS,llpocalin,SS,lipocalin odorant-binding protein 2A 4.9 

414117 W88559 Hs.1787 ,TM,lonJrans,KJetra, proteollpld protein 1 (Pellzaeus-Marzbac 4.9 

416783 AA206186 Hs.79389 SS.TMJM monocyte to macrophage differentiation-a 4.9 

401093 TM, LRRCTJM.LRRCT, C12D0D5B6*:gil6330167jdbJ|BAAB6477.1| (A 4.9 

411096 U80034 Hs.68563 Peptidase^, mitochondrial intermsdiste peptidase 4.9 

457411 AW085961 Hs, 130093 SS ESTs 4.9 

436007 AI247716 Hs.232188 .SS.adMnc, ESTs 4.9 

450506 NM.004460Hs.418 SS,DPPlV_N_term,Peptidase_S9,SS,DPPIV NJbroblast activation protein, alpha 4.9 

417975 AA641833 Hs.30DS5 ,SS,trypsin hypotheb'calproteil FLJ231B6 4.9 

421072 AI215069 Hs.891f3 SS ESTs 4.8 

427032 AF012023 Hs. 173274 ,SS,14-3-3 integrin cytoplasmic domain-associated p 4.8 

447752 M73700 Hs.105938 SS.transferrin,7tm_1 .transferrin, lactotransferrin 4.8 

403199 NA SS,TM,Folate_cafrier,SS,TM,Folate_carfie NM_025243*:Homo sapiens solute carrier f 4.8 

427122 AW057736 Hs.323910 ,SSJM,pkinase,RecepJJomain,SH2,PH,FurHER2 receptor tyrosine kinase (c-erb-b2, 4.8 

445900 AF070526 Hs.13429 ,SS,Ca_channel_B, Homo sapiens clone 24767 mRNA sequence 4.7 

413048 M93221 Hs.75182 SS,™,lec^c,fn2,Rioir L BJedin I SS,TM mannose receptor, C type 1 4.7 

419563 AA526235 Hs.193162 SS Homo sapiens cDNA FLJ1 1983 lis, clone HE 4.7 

442432 BE093589 Hs.38178 SS hypothetical protein FU23468 4.6 

452093 AA447453 Hs.27860 ,SS,TM,7tm_1. Homo sapiens mRNA; cDNA DKFZp585M0723 (f 4.6 

442323 AW016669 Hs.29f90 ,SS,TM r CBS,vol(age_CLC ESTs 4.6 

45D6DB AI568605 Hs.6O380 ,TM,G!yco_hydro_1 ESTs, Moderately similar to ALUB_HUMAN A 4.B 

435542 AA6B7376 Hs.269533 ,SS,pkinase,RhoGEF,lg,PH,SH3, ESTs ' 4.6 

417576 AA339449 Hs.82265 AiRS.fomvLtosf.GARS.SS.GARS.AlRS.forpliosphoiibosyiclvcinamidBformyltransfer 4.6 

446089 A1860021 Hs.270651 .pkinaso ESTs, Moderately similar to A47582 B-eeJ 4,6 

445413 AA151342 Hs.12677 SS,UPF0099,SS,UPFQD99, CG1-147 protein 4.6 

424420 BE614743 Hs.14B688 ,SS,TM,MAPEG, prostaglandin E synthase 4.5' 

432378 AI493046 Hs.146133 ,SS,TM,UDPGT ESTs 4.5 

452190 H26735 Hs.91668 ,SS,TM 1 PH,SH2,Furin-like,pkinase,Recep L Homo sapiens done PP1498 unknown mRNA 4.5 

434674 AA831879 Hs.1 36985 ,SS,Histdeacetyl, ESTs 4.5 

419986 AI345455 HsJ8915 pkinase,OPR, GA-binding protein transcription factor, 4.5 

421582 AI910275 Hs.1406 SS,trefoil,SS,TM,ldl_recepta,SRCR,lryps trefoil factor 1 (pS2) 4.5 

410361 BE3916D4 Hs.62661 SS.TWI.GBP.TM.GBP guanylate binding protein 1, Interferon- 4.5 

426327 W03242 Hs.44898 SS Homo sapiens clone TCCCTA00151 mRNA sequ 4.5 ■ 

406639 M97711 SS,SS,ig, gb;Human T-cell receptor (V beta 18.1, J 4.5 

452834 AI638627 Hs.105685 ,SS,DEAD,Fork_head KIAA168B protein 4.5 

427316 AA179949 Hs.175563 SS Homo sapiens mRNA; cDNA DKFZp564N0763 (f 4.4 

446733 AA8S3360 Hs,26040 ,SS,TM,p450, ESTs, Weakly similar to fatty acid omega 4.4 

442118 AA976718 Hs.202242 ,ig,Sema, ESTs 4.4 

421524 AA312082 Hs,105445 SS GDNF family receptor alpha 1 4,4 

453060 AW294092 Hs.21594 ,SS,ras,Y_pbosphatase,ras hypotheScal protein MGC15754 4,4 

453403 BE466639 Hs.61779 ,SS,HMG_box,1ilament, Homo sapiens cDNA FU13591 fis, clone PL 4,4 

444301 AK000136 Hs.10760 SS.LRR.SS asporln(LRR class 1) 4,4 

453619 HB7648 Hs.33922 SS Homo sapiens, clone MGC:9084, mRNA, comp 4,3 

432656 NIVL000246HS.3076 SS.LRR, MHC class II transacJivator 4.3 

426384 AI472078 Hs.303682 ,SS,ArfGap, ESTs 4.3 

431701 AW935490 Hs.14658 ,SS,BlR Human chromosome 5q13.1 clone 5G8 mRNA 4.3 

416931 D45371 Hs.60485 SS.CIq.Collagen.SS.CIq, ad/pose most abundant gene transcript 1 4.3 

420854 AW296927 ,SS,TM,Feptidase_M1, gb:UNH-BW0-ajc-c-07-O-Ul.s1 NCI_CGAP_Su 4.3 

416867 D31771 Hs.89404 SS,homeobox,riomeobox ( msh (Drosophila) homeo box homolog 2 4.3 

443514 BE464288 Hs.141937 ,SS,TM,MIP, ESTs 4,3 

447499 AW262580 Hs.147674 .SSJM.cadherin.cadherin protocadherin beta 16 4.3 
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441560 F13386 Hs.7888 
409064 AA062954 Hs, 141 883 
422667 H25642 Hs.133471 
454032 W31790 Hs.194293 
432663 AI984317 Hs.1 22589 
401747 

432882 NM_013257Hs.279696 
437036 AI571514 Hs.133022 
447754 AW073310 Hs.163533 
443194 AI9S496B Hs.279009 
451871 AI821D05 Hs.118599 
457211 AW972565 Hs.32399 
421565 NM.000399Hs.1395 
431657 A1345227 Hs.105448 
427899 AA829286 Hs.332053 
444779 AI192105 Hs.147170 
442295 AI827248 Hs.224398 
436396 AI6B3487 Hs.152213 
446039 AI150491 Hs.90756 
422938 NMJ)01809Hs,1594 
406922 S70284 
439285 AL133916 Hs.172572 
424800 AL035588 Hs.153203 
429922 Z97630 Hs.226117 
447178 AW594641 Hs.192417 
409038 T97490 Hs.50002 
452747 BE1 53855 Hs,61460 
420139 NM_005357Hs.95351 
408877 AA479033 Hs.130315 
403329 NA 

439926 AW014875 Hs.137007 
430832 AI073913 Hs.100686 
432481 AW451645 Hs.151504 
452410 AL133619 Hs.29363 
418661 NMJ01949HS.1189 
431958 .X63629 Hs.2877 
425071 NfiA_013989Hs.1 54424 
447197 R36075 
428722 U76456 Hs.190787 
428330 122S24 Hs,2256 
423242 ALO39402 Hs.125783 
449048 Z45051 Hs.22920 
414831 M31158 Hs.77439 
413589 AW452631 Hs.313803 
408875 NM_015434Hs.48604 
418629 BE247550 Hs.86859 
450787 AB008190 Hs.25475 
414870 N72264 Hs.300670 
450325 AI935962 HS.262B9 
407633 NM.007069Hs.371 89 
426172 AA371307 Hs.125056 
442262 BE1 70651 Hs.87G0 
427961 AW293165 Hs.143134 
445563 AW8736QS Hs.149006 
403943 

408761 AA057264 Hs.238936 
423279 AW959861 Hs.290943 
420440 NM 002407HS.97644 
445107 AI203121 Hs.147313 
428303 AW974476 Hs. 183601 
411667 BE16Q198 
427809 M263B0 Hs.1 80878 
418203 X54942 Hs.83758 
430376 AW2920E3 Hs.12532 
444190 AI878918 Hs.10526 
433495 AW373784 Hs.71 
429838 AI916662 Hs.211577 



.pklnase, 

,SS,CUB, 

,S5,TM,FM0-llke 

r SS,TM 

TM 

.SS.filamenUilament 

pkinase,pkinase_C, 

,SS,TM 

,pk1nase, 

,SS,TM 

,SS,GDNF, 

WH1.WH1 

zf-C2H2,SS 

,SS,TM,pkinase 



Homo sapiens clone 23736 mRNA sequence 

ESTs 

ESTs 

ESTs, Weakly similar jo 154374 gane.NF2 
ESTs 

Homo sapiens keratin 17 (KRT17) 
sewrn/glucocortlcoid regulated kinase-li 
ESTs 

Homo sapiens cDNA FU14142 iis, clone MA 

matrix Gla protein 

ESTs 

ESTs, Weakly similar to S51797 vasodllal 
early growth response 2 (Krox-20 (Drasop 
Ts, Weakly similar to B34087 hypothetl 



,COLFI,vwc,Collagen, 
.wnt, 

,TM,Glyca_hydra_1 

,SSJM,thlolase, 

SS,TM,Desaturase,SS 

,SS,ig,pklnase,LRRNT ( LRRCT, 

HLH.SS 

,SS,TM,linker_histone,7tmJ 

,SS,TM 

SS,IL8,SS,IL.8 

,SS,HLH 

,SS,TM,p450, 

,SS,TM 



Homo sapiens cDNA FU11469 fis, clone HE 
wingless-type MMTV integration site faml 
ESTs 

centromere protein A (17kD) 
gb:stearoyl-CoA desalurase [human, adipo 
hypothetical protein FU20093 
MyoD family inhibitor 
HI hlstone family, mamberO 
ESTs 

small inducible cytokine subfamily A (Cy 
Ig superfamily receptor LNJR 
lipase, hormone-sensitive 
ESTs, Weakly similar to A47582 B-cell gr 
Target Exon 
ESTs 

ESTs, Weakly similar to JE0350 Anterior 
Homo sapiens cDNA FLJ11973 fis, Clone HE 
Homo sapiens mRNA; cDNA DKFZp434E2321 (f 
bh E2F transcription factor 3 

SS,TM,Cadherin_C_term,cadherin,SS,TM,cadcadherin3, type 1, P-cadherin (placenta 
SS,T4_delodinase 1 T4_delodinase, debdinase, iodothyronlne, type It 

,TM,SDF, gb:yh88b01 s1 Soares placenta Nb2HP Homo 

.SS.TIMP, (issue inhibitor of meta'ioproteinase 4 

SS,PeptidaseJVI1Q,SSPeptidase_M10,hemopematrlx metalloproteinase 7 (MMP7; uterin 
SS DEME-6 protein 

SS.SSJM similar to S68401 (cattle) glucose indue 

,SS,cNMPJ?;ndlng,Ri;a,HMG_box protein kinase, cAMP-dependent, regulate 

,SS,abhydrolase ESTs, Highly similar to AF1 57833 1 noncl 

SS DKFZP434Bi6B protein 

,SH2,PH,SS,TM,PH 1 SH2 l Furin-like,pkinas growth factor receptor-bound protein 7 



SS,TM,M!P,SS,TM r iMSP, 
SS 

TM.TM 
.SS.DENN 
,SS,START, 

,SS,WH1,WH1 
p450,SS,p450 
.SSJMJtmJ. 



,SS,SRCR,U! 
,SS,TM 

SS,RGS,RGS,RGS 



SS,lg,MHCJ,connexin,SCAN,SS,TM 



aquaporin 7 
KIAA1 204 protein 
ESTs 

similar to rat HREV1 07 
ESTs 

deleted in liver cancer 1 

ESTs 

ESTs 

C5000355:gl|4503225|rei|NP_000765.1 1 cyt 
ESTs, Weakly similar to (defline not ava 
ESTs 

mammaglobin 2 (MGB2; mammaglobin B; lip 
ESTs, Weakly similar to I3S022 hypotheli 
regulator of G-protein signalling 16 
gb:QV1-HT0413-010200-059-hQ3 HT0413 Homo 
lipoprotein lipase 
CDC28 protein kinase 2 
chromosome 1 open reading frame 21 . 
cysteine and giyc'ne-riGh protein 2 
alpha-2-glycoproteln 1, zinc 
kinectln 1 (kinesln receptor) 
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454071 AI041793 Hs.42502 ,TM,7tmJ, ESTs 3.7 

451659 H44491 Hs.252938 ,SS,TM,EGF 1 kJI recepUldL/ecepUEGF ESTs, Weakly similar So ALU1_HUMAN ALU S 3.7 

420281 AI623693 Hs.1 91533 ,SS,AAA, ESTs 3.7 

427691 AW194426 Hs.20726 ,SS,Glycos_transf_2, ESTs 3.7 

5 428824 W23624 Hs.173059 S3 ESTs 3.7 

424676 Y08565 Hs.151678 Glycosjransf 2,Ricin_BJertln,SS,Glycos UDP-N-acetyl-alpha-D^alado5amirie:polyp 3.7 

418026 BE379727 Hs.83213 llporalln.SS.IiporallnJipocalin.ferriti fatty acid binding protein 4, adipocyte 3.7 

457465 AW301344 Hs.122908 ,SS,Pribosyltran,Sulfatase DNA replication factor 3.7 

417601 NM_014735Hs.82292 PHD,pklnase,SS KIAA0215 gene product 3.7 

10 407999 AI126271 tfs.49433 SS ESTs, Weakly similar to YZ28 HUMAN HYPOT 3.7 

425548 AA89D023 Hs.1905 SS,TM,fn3,SS,™,fn3 > prolactin receptor 3.7 

446619 AU076643 Hs.313 ,SS,TM,efhand,ion_trans secreted phosphoproteln 1 (osteopontin, 3.7 

411213 AA676939 Hs.69235 SS,TM,CUB,F5 F6 type_C,MAM,SS,TM,CUB t F5_neuropilin 1 3.6 

406625 Y13647 Hs.119597 SS,TM,Oesaturase,SS stearoyi-CoA desaturase (deIfa-9-desatiif 3.6 

15 417511 AL049176 Hs.B2223 SS chordin-like 3.6 

428769 AW2D7175 Hs.106771 ,SS,7tm_1,SPRY, ESTs 3.6 

407137 T97307 ■ ,SS,TM,GQA1 CD39 gb:ye53h05.s1 Scares fetal liver spleen 3.6 

401866 ,SS,filament, Target Eicon 3.6 

451195 U10492 Hs.438 SS,homebbox,E1s,SS,homeobox, mesenchyme homeo box 1 3.6 

20 426044 . AA502430 Hs.336695 SS ESTs 3.6 

426310 NM_0QQ909Hs.1 69266 SS,TM,7tmJ, neuropeptide Y receptor Y1 3.6 

440029 AW089705 Hs.293711 SS ESTs, Weakly similar to S64329 probable 3.6 

408573 AA284775 Hs.43148 ,SS,TM,PMP22_Claudin, ESTs 3.6 

431830 Y16645 Hs.271387 .SSJM.U small inducible cytokine subfamily A (Cy 3.6 

25 444781 NMJ1440OHS.11950 ,SS,PH,!adamase_B GPI-anchored metastasis-associated prate 3.6 

431493 AI791493 Hs.129873 .SS^O^SO ESTs, Weakly similar to A36036 cytochrom 3.6 

414175 AI308876 Hs.103849 ,TM,hem3p2xin,Peptidase_M10,hemopexin l Petiypothetlcal protein DKFZp761D1 12 3.6 

4117B9 AF245505 Hs.72157 Ig.LRRCT, DKFZP564I1 922 protein 3.6 

418851 AI417828 Hs.192435 ,SS,TM ESTs 3,5 

3 0 453968 AA847843 Hs.62711 ,SS,HMG_box, Homo sapiens, clone I MAGE:3351 295, mRNA 3.5 

407104 S57296 Hs.323910 ,SS,TM,SH2 l PH,pkinase,Recep_L„domain,Furv-erb-b2 avian erythroblastic leukemia v 3.5 

449051 AW961400 Hs.333526 SS HER2 receptor tyrosine kinase (c-erb-b2, 3.5 

434398 AA121098 Hs.3838 pkinase,POLO_box,SS,pkinase,POLO box, serum-inducible kinase 3,5 

454042 H22570 Hs. 172572 .SSJg.pkfnase.LRRNT.LRRCT, hypoffietol protein FLJ20093 3.5 

35 459496 AA80S940 Hs.274450 ,SS,TM,KRAB,SCAN,zf-C2H2,ig EST 3.5 

414998 . NMJ02543HS.77729 ,SS,TM oxidised low density lipoprotein (lectin 3.5 

407756 AA1 16021 Hs.38260 SS,UCH-1,UCH-2,SS,TM,G_glu_transpept ubiquitin specific protease 18 3.5 

- 4421D1 AI651930 Hs.135684 SS ESTs 3.5 

449722 BE230074 Hs.23960 cyclln.SS.TM.cydin, cyclin B1 3.5 

40 452554 AW452434 Hs.58006 SS ESTs, Weakly similar to ALU5_HUMAN ALU S 3.5 

421991 NM_014918Hs. 110488 SS KIAA0990 protein 3.4 

420058 AK001423 Hs.94694 SS Homo sapiens cDNA FLJ105S1 fis, clone NT 3.4 

425776 U25128 Hs,159499 SS,TM,7tm_2,SS,TM,7fm_2 parathyroid hormone receptor 2 3.4 

407846 AA426202 Hs.40403 ,TM,ABC_membrane,ABC_tran,RibosomaLS4eCbpyp300-interading transactivator, wit 3.4 

45 406925 L34041 Hs.9739 ,SS,TM,transportj)rot,SWIB,RhoGAP,DAG_PEglyceral-3-pho3phate dehydrogenase 1 (so 3.4 

445873 AA250970 Hs.251946 ,SS,rrm,PABP,pkinase,14-3^,rrm poly(A)-bindsng protein, cytoplasmic 1-i 3.4 

413054 NMJ02318HS.83354 ,SS,TM,mitojKn-,LysyLoxidase lysyloxldase-like 2 3.4 

414921 BE390551 Hs.77628 SS,START,SS,START,NNMT_PNMT_TEMT, steroidogenic acute regulatory protein r 3.4 

452268 NMJ03512HS.23777 SS.histcne.Calc.CGRP IAPP,ig,MHC l,SPRY,H2A histone family, member L 3.4 

50 428862 NM_000346Hs.2316 SS.HMGJbox, SRY (sex determining region Y)-box 9 (ca 3.4 

412520 AA442324 Hs.795 histone,SS,histone,BolA H2Ahistona family, member O 3.4 

410530 M25609 Hs,64173 ATP-syni_ab,SS,7tmJ.ATP-synLab ATPase, H transporting, lysosomal (vacuo 3.4 

4017BO filament.SS.filament.filamem' NJL005557':Homo sapiens keratin 16 (foca 3.4 

447131 NIVL0045B5Bs.17466 TM retinolc add receptor responder (tazaro 3.4 

55 41B334 AA319233 Hs,5521 ,SS,TM,Ribosomal L27e, ESTs 3.4 

415138 C18355 Hs.295944 . ,Kunltz_BPTI, tissue factor pathway Inhibitor 2 3.4 

421168 AF182277 Hs.330780 SS,p450,SS cytochrome P450, subfamily JIB (phenobar 3.4 

431473 AA825686 Hs.321176 SS ESTs, Weakly similar to S65824 reverse t 3.4 

421379 Y15221 Hs.103982 SSJL8, small inducible cytokine subfamily B (Cy 3.4 

60 411984 NMJ05419HS.72988 SH2,STAT,SS,STAT signal transducer and activator offrans 3.4 

408101 AW9685M Hs.123073 .pklnase, CDC2-related protein kinase 7 3.4 

405366 RhDGEF,PH,SS,RhoGEF,PH, MM.003371*;Homo sapiens vav 2 oncogene ( 3.4 

414612 BE274552 Hs.76578 SAP,SS,FG-GAP,vwa protein inhibitor of activated STAT3 3.4 

411393 AW797437 Hs.69771 SS,sushi ; trypsln,vwa,rrm,fibrinogen_C,fn B-factor, properdin 3.3 

65 435767 H73505 Hs.117874 ,SS,Peptidase_S8,P,Peptldase_Sa,P ESTs 3.3 

416405 D36961 Hs.79299 ,SS,TM lipoma HMGIC fusion partner-like 2 3.3 

433068 NM_006456Hs.2B8215 .SS.Pribosyltran, sialyltransferase ' 3.3 
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445462 AA378776 Hs.288649 




hypothetical protein MGC3077 




439452 AA918317 Hs.57987 




B-cell CLL/lymphoma 11B (zinc finder pro 




452017 AF1 09302 Hs.27495 




prostate cancer associated protein 7 




409099 AK000725 Hs,50579 


cc 


hypothetical protein FLJ20718 




452106 AI141031 Hs.21342 


SS 






447519 U46258 Hs.339665 








42B928 AF037062 Hs,172914 


,SS,adn_siion l TGF-beta 1 TGFb_propeptida 


retino! dehydrogenase 5 {11-cis and 9-cl 




438825 BE327427 Hs.79953 


( SS,TM,hislDne,ANF_receptor,guanylate_Gy 


F. STS or- 




414575 H11257 Hs.22968 


,SS,pkinase,lg, 

SS,TGFb_propeptlde,TGF-beta,SS 


Horn) sapiens clone IMAGE:451939, mRNA se 




417837 AL079905 Hfl.1103 


transforming growth factor, beta 1 




422128 AW881145 




gb'QV0-OT0033-01040D-182-a07 OT0033 Homo 




445941 A1267371 Hs. 172636 


SS.SSJipoxygeaase.PLAT 






429973 AE423317 Hs. 164680 


,SS,T-box,UDPGT 








SS SS Peptidase M1 EGF ig lectin c sushi 


r t'rf 




459561 AI547306 Hs. 134981 


S ?. 






425741 AF052152 Hs, 159412 


,pKinas8, 


Homo sapiens clone 24528 mRNA sequence 












456508 AA502764 Hs. 123469 




ESTs, Weakly similar to AF2UB855 1 BM-01 




434228 Z42047 Hs.263978 


,SS,TM,7tm_l 


Homo sapiens PR02751 mRNA, complete cds 




415752 BE314524 Hs.7B776 




putative transmembrane protein 




400419 AF084545 


,SS / Pept,dase_M'!, 






439750 AL359053 Hs.57664 • 


,TM,intefl,rin_B,RicinJ3Jectln,rrm 


Homo sapiens mRNA full length Insert cDN 




423658 AL137326 Hs. 133483 




Homo sapiens mRNA; cDNA DKFZp434BQ650 (f 




428514 AW23G861 Hs.193139 


,SS,START,NNMT_PNMT_TEMT, 




3.3 


42BS98 AAB52773 Hs,334838 


SS 


KIAA1866 protein 


3.3 


448986 Y09763 Hs.22785 


SS,TM,TM 


gamma-aminobutyric acid (GABA) A recepto 




432072 N62937 Hs, 2691 09 


,Sema,ig, 






417433 BE270266 Hs.82128 


SS r TM,LRRCT,LRRNT,LRR,TM,LRRCT, 


5T4 oncofetal trophoclast glycoprotein 




452194 AI694413 Hs.332649 


iSS,TM, 71(1^3, AWF_rEcepior,sushi 


olfactory receptor, family 2, subfamily 




444051 N48373 Hs.10247 


,SS,lg, 


activated leucocyte ce!i adhesion mobcu 




420042 AW015140 Hs. 161723 


,SS,CUB, 


ESTs 




457292 A1921270 Hs.334882 


SS,TM,SS,TM,G-patcti 


hypothetical protein FD14251 




421458 NM_003654Hs,1 04576 


SS 


carbohydrate (keratan sulfate Gal-6) sul 




431104 AW970B59 Hs. 313503 


,Sema,rg, 


ESTs 




443767 BE562136 Hs.9736 


1 SS,PCI,RasGEF,hormonB_rec,zf-C4, 


proteasome (prosome, macropaln) 26S subu 


3.2 


419589 AW9737Q8 Hs.201925 


,FGF, 


Homo sapiens cDNA FLJ 13446 fis, clone PL 




415447 Z97171 Hs,78454 


SS,OLF,OLF,OLF,RibosaniaLL4 


myocllln, trabecular meshwork inducible 


™ 


443464 BE548446 Hs.5167 


SS,TRSSF,SS,TM 


Homo sapiens mRNA; cDNA DKFZp434F152 (fr 


*~ 


423431 AA326062 


,SS,p450,p450 


gb;EST29171 Cerebellum II Homo sapiens c 




41327B BE5530B5 Hs.833 


,SS,TM,ubiquitin,lamlnin GJaminln EGF.k 


Interferon-stlmutaied protein, 15 kDa 




458451 AW297181 Hs 195922 


,SS,Rlbosomal L14 


ESTs 




440449 AA885430 Hs 201925 


,FGF, 


Homo sapiens cDNA FLJ13446 fis, clone PL 






SS,laminin_EGFJamIninJ\lterm,adh_sriorl,S laminin, beta 3 fnlcein (125kD), kaiinln 




434876 AF1 60477 HS.61460 


,SS,HLH 


lg superfamiiy receptor LNIR 




435575 AF213457 Hs.44234 


SS.lg.SS.TM 


triggering receptor expressed on myeloid 




415773 R21651 Hs.324725 


,SS,TM r Rib33orna|_S3Ae,G-gamma 
SS 


ESTs, Moderately similar to A47582 B-cel 






KIAA1681 protein 




450847 NM.003155HS.25590 


.SS.hotr.eobox, 


stanniocalcin 1 




426075 AW513691 Hs. 270149 


,SS,ff!3, 


ESTs, Weakly similar to 21 09260A B cell 




452110 T47667 Hs.28005 


,SSJM,Activin_recp,pk!nase 


Homo sapiens cDNA FLJ113D9 fis, clone PL 


I 


439963 AW247529 Hs.6793 


JM,p450,Ets 


platelet-activating factor acefylhydrola 




402837 NA 


SS 


ENSPO0O0D241312*:DJ947L8.1.8 (novei Sush 


15 


439451 AF086270 Hs.278554 


,SS,Chroma_shadow ) chromo, 


heterochrornatin-llke protein 1 


11 


406664 L34041 Hs.9739 


,SS,TM,tfansporLprot,SWIB,RhoGAP,DAG„PEgVcerol-3-phosphate dehydrogenase 1 (sd 


31 


417315 AI060042 Hs.336901 


,SS 1 RNA_pD!_A,RMA_poLA2,Ribo3DmaLS24e,ribosoma[ protein S24 


3.1 


413011 AW06B115 Hs.821 


SS,LRR,LRRNT,SS,LRRNT,LRR, 


blglycan 


3.1 


414987 AA524394 Hs.294022 


,SS,connexin 1 bormone_rec,zf-C4,cDnnexin 


hypothetical protein FLJ14950 


3.1 


429197 H24471 Hs.26930 


,SS,Gelsolin, 


ESTs, Weakly similar to T20272 hypothec 


3.1 


448030 N30714 Hs.325960 


iSS.TM 


membrane-spanning 4-damalns, subfamily A 


3,1 


407604 AW191962 Hs,249239 


,SS,TM,C1q, 


collagen, type VIII, alpha 2 


3.1 


419092 J05581 Hs.89603 


SS,TM,SEA, 


mucin 1, transmembrane 


3.1 


456672 AK002016 Hs;1 14727 


,SS,PK,PK 


Homo sapiens, clone MGC:15327, mRNA, com 


3.1 


443171 BE2B112B Hs.903O 


SSJM,7tmJ,rmi,SS 


TONDU 


3.1 


452256 AKO0Q933 Hs.28661 


,TM,GDl,7tm_1, 


Homo sapiens cDNA FLJ10071 fis, clone HE 


3.1 


432201 AI538613 Hs.29B241 


SS,TM,lrypsin ) SSJM t trefoil,trypsin,tref 


Transmembrane protease, serine 3 


3.1 


406642 AJ245210 


SS 


gb:Homo sapiens mRNA for immunoglobulin 


3.1 


400903 NA 


SS 


Target Exon 


3.1 
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434408 AIQ31771 Hs.132586 ,SS,Glyco_hydro_2 ESTs 3.1 

452994 AW962597 Hs.31 305 SS,WD40,SS,WD40, KfAA1547 protein 3.1 

445903 AI347487 Hs.132781 fn3,SS,TM,EF1BD class t cytokine receptor 3.1 

424364 AW383226 Hs.201 189 SS ESTs, Weakly similar to G017S3 airophin- 3.1 

410196 A1936442 Hs.59838 UBACT_repeatSS,UBACT„repeai,ThiFJamilyhypDthefical protein FLJ10B0B 3.1 

419150 T29618 Hs.89640 .TMpklnaseM TEK tyrosine kinase, endothelial (venous 3.1 

433417 AA567773 Hs.8859 ,SS,SRCR, Homo sapiens, Similar to RIKENcDN A 5830 3.1 

418624 AI7340B0 Hs. 104211 .Semajg, ESTs 3.1 

436291 BE568452 Hs.5101 .SS.abhydrolase, protein regulator of cytokinesis 1 3.1 

411000 N40449 Hs.201619 SS ESTs, Weakly similar to S33383 SEB4B pro 3.1 

412519 M196241 Hs.73960 SS,Troponin,HemaggIulinln,SS,TM,C2,Tropo troponin T1,skeleiaJ, slow 3.1 

450223 AA418204 Hs.241493 .SS.proJsomerase, natural (cllleMumor recognition sequenc 3.1 

422790 M309875 Hs.25933 ,TM,hlstone,Sec1,hlstone,sugar tr ESTs 3.1 

424269 AW137691 Hs.199754 ,SS,TM,7f/rL2,GPS ESTs 3.1 

435854 AJ278120 Hs.4995 ,SS,WD40 putative ankyrin-repeat containing prote 3,1 

447388 AW630534 Hs.76277 ,SS,TM,nm,oxldored_q6,oxldored_q6 Homo sapiens, clone MGC;9381, mRNA, comp 3,1 

451631 R0B866 SS gb:ye79c02.s1 Scares fetal liver spleen 3,0 

448105 AW591433 Hs.298241 .SSJM.trefoil.trypsin.trefoii Transmembrane protease, serine 3 3.0 

438637 BE50Q941 Hs.126730 ,TM,PH, ESTs, Weakly similar to KIAA1 21 4 protein 3.0 

423024 AA593731 Hs.325823 .SSJM.CD36 ESTs, Moderately similar to AL.U5JHUMAN A 3.0 

456592 R91600 ,SS,Ran_BP1,UM,Ran_BP1 l GRIP,TPR,LIM gb:yq10c02,r1 Soares fetal fiver spleen 3.0 

425920 AL049977 Hs. 162209 SS ) TM,SS,Tfy1 l PMP22_Claudin,PMP22 Claudinclaudin 8 3.0 

444670 H58373 Hs.332938 ,SS,TM hypothetical protein MGC5370 3.0 

401197 art^rf, ENSP0Q000229263*;HSPC213. 3.0 

437755 AW204256 Hs.291687 ,wnt, ESTs 3.0 

452560 BE0770B4 Hs.336432 ,SS,nrm,zf-RanBP,pkina5e,C2,pkinase_C,DA ESTs 3.0 

410274 AA381807 Hs.61762 SS.SS hypoxia-Inducible protein 2 3.0 

450098 W27249 Hs.8109 SS hypothetical protein FLJ21080 3.0 

404826 ,SS,TM Target Exon 3.0 

458389 H70284 Hs.160152 ,SS,RA ESTs, Weakly similar to FPHU alpha-fetop 3.0 

408196 AL034548 Hs.43627 HMGJrox.pWnase.zf-CCHC.SSJHHMGJjox.SRY (sex determining region Y)-box 22 3.0 

433675 AW977653 Hs.7531 9 .SS.ribonuc^retLsm, ribonucleotide reductase W polypeptide 3.0 

418848 AI820961 Hs.193465 ,death,2U5,pkinase,Activin_recp, ESTs 3,0 

422095 AIB6B872 Hs,282804 SS.Cu-oxidass.SS.Cu-oxIdase.Cu-oxIdase hypothetical protein FLJ22704 3.0 

415992 C05837 Hs.145807 ,SS,TM hypothetical protein FLJ1 3593 3.0 

424631 AA688021 Hs.179808 SS ESTs 3.0 

409956 AW103364 Hs.727 SS,TGF-beta,TGFb_propeptide 1 SS,TGF-beta,inhibin, beta A (activin A, actrvln AB a 3.0 

419667 AU077005 Hs.92208 SS,disintegrin,Reprolysin,PepJ/l12B_prope a disintegrin and metalloproteinase dcma 3.0 

450946 AA374569 Hs.1 27698 SS ESTs, Moderately sim'larto 2109260A B c 3.0 

447770 AB032417 Hs.19545 Frizzled,Fz,SS 1 TM 1 Frizzled,Fz, frizzled (Drosophila) homolog 4 3,0 

423826 U20325 Hs.1707 SS.SS cocaine- and amphetamine-regulated trans 3.0 

418838 AW385224 Hs.35198 .SS,TM,Phosphodiest, ectonircleotidepyrophosphatasefphosphodi 3.G 

442804 AW300118 Hs.131257 ,SS,TM,G-gamma ESTs 3.0 

432284 AA532807 Hs.105822 .SSJM.pkinase, ESTs 3.0 
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TABLE 20A 

Table 20A shows the accession numbers for those pkeys lacking unigeneTD's for Table 20. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mKNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 

Genbank accession numbers 



Pkey 


CAT number 


Accessions 


410785 


1221055.1 


AW803341 AW803265 AWB03403 AW8034GB AW803402 AW803413 AW803268 AW80339S AW803334 AW803355 


411667 


1253334 1 


Bt1 601 98 AW935398 T1 1520 AW935930 AW856073 AW861 034 


416636 


177402J 


AW749855 AA225995 AW750208 AW750206 


420854 


197072J 


AW296927 AI634514 AI2B316B AA281079 


422128 


211994J 


AW881 145 AA49071 8 MB5637 M304575 T06067 AA331 991 


423431 


228162 1 


M326062 AA325758 AW962182 


423945 


233566 1 


AA410943 AW948953 AA334202 M332882 


426878 


273265 1 


BE069341 AW7484Q3 AL044891 A1908240 AA393080 


433687 


37306U 


AA743991 AA604S52 AW272737 


447197 


711623J 


R36075 AI366546 R361B7 


451631 


878098 1 


R0066B R01523 A1805815 


45S207 


165078 -1 


AA1 93450 


456592 


202684J 


R91600T87079AA291455 
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TABLE 20B 



Table 20B shows the genomic positioning for those plceys lacking unigene ID's and 
accession numbers in Table 20. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos prabeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entiUed 'The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1939) 402:489495. 
Strand: I ndicates DNA strand from which exons were predicted. 

Ntpositfart: Indicates nucleotide positions of pnsdicied exons. 



Pkey 


Ref 


Strand 


400608 


9887666 


Minus 


400903 


2911732 


Plus 


401045 


8117619 


Plus 


401093 


8516137 


Minus 


401197 


9719705 


Plus 


401747 


9789672 


Minus 


401760 


7249190 




401761 


7249190 


Minus 


401866 


6018106 


Plus 


402230 


9966312 


Minus 


402406 
402578 


9796239 
9884928 


Minus 
Plus 


402606 


9909429 


Minus 


402837 


9369121 


Minus 


403199 


9958183 


Minus 


403329 


851 6120 


Plus 


403593 


6862650 


Minus 


403943 


7711864 


Plus 


404091 


7684554 


Minus 


404347 


9838195 


Plus 


404826 


6572184 


Plus 


405366 


2182280 


Plus 


405654 


4895155 


Minus 



96756-97558 
59112-59228 

90044-90184,91111-91345 

22335-23166 

176341-176452 

116596-118316,119119-119244,119609-119761,120422-120990,130161-130381,130488-130593,131097-131258,131865- 
131932,132451-132575,133580-134011 



73126-73623 
29782-29932 
110326-110491 



100742-100904,101322-101503 
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TABLE 21: 210 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
5 LIKELY TO ENCODE EITHER ENZYMES OR PROTEINS 

AMENABLE TO MODULATION BY SMALL MOLECULES 

Table 21 shows 210 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode either enzymes or proteins amenable to modulation by small 

10 molecules. These were selected as for Table 19, except that the ratio was greater than or equal 
to 3.0, the "average" normal adult tissue level was set to the 85 th percentile value amongst 
144 non-malignant tissues, and the 96 th percentile value amongst 73 breast cancers was 
greater than or equal 80 units, and the predicted protein contained a structural domain that is 
indicative of enzymatic function or of being modulatable by small molecules (e.g. pkinase, 

1 5 peptidase, phosphatase, ATPase, or ion_transporter domains). The predicted protein domains 
are noted. 



Pkey: 

UnigenelD: 
Unlgene Title: 
R1: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 

Unigene gene iitle 

Ratio of 93rd percentile tumor to & percentile of normal body tissue 



Pkey ExAccn UnigenelD Predicted Protein Domains 



40 



449746 
400292 
424735 
407178 
40B045 
450375 
429170 
445730 
424634 
420757 
424399 
447350 



4240D1 
418Q0? 
421727 
411669 



AI668594 Hs, 176538 
AA250737 Hs.72472 
U31875 Hs.272499 
AA195651 Hs.104106 
AW13S959 Hs.245123 
AA009647 Hs.8850 
NM_001394Hs.2359 
AI624342 Hs.170042 
NM_0Q3613Hs.151407 
X78592 Hs.93915 
AI905687 Hs.2533 
AI375572 Hs. 172634 
AA1 93450 

X52509 Hs.161640 

D90041 Hs. 155956 
W678B3 Hs.137476 
M13509 Hs.83169 



UnigeneTitle 

ESTs, WeaMy similar to CP4Y_HUMAN CYTOC 



) X07620 Hs.2258 

3 AW873596 Hs.182278 

3 AI351010 Hs.102267 

5 W72838 Hs.2533 

1 AW732573 Hs.47584 

5 H87679 Hs.102267 

5 NMJ02497Hs,1 53704 

7 R28363 Hs.242B6 



SS,p450 

death ,ZU5JM,Acllvin„recp,pkinase, 
SS,TM short-chain alcohol dehydrogenase family 

SS.Dihydroorotase, ESTs 
Phosphodlest,Somatomedin_B, ESTs 

SS,TM,dislntegrin,Pep_M12Bj3ropep,Reproi a dlsintegrin and metalloprotelnase doma 
DSPc,Rhodanese, dual specificity phosphatase 4 

SS,TM,Cation_efilUx ESTs 

i&tspJ.SS.AAA cartilage intermediate layer protein, nu 

hormone_rec,Androgen_recep,zf-C4, androgen receptor (dihydrotestosterone r 
SS aldehyde dehydrogenase 9 family, member 

pkinase, ESTs 

SS,TM,p45Q,p450 gb:zr40e07.r1 Soares_NhHMPu_S1 Homo sapj 

SS,TM < amlnotrai}_1_2,Cadherin_CJerm,cadhtyrosine aminotransferase 
SS,p450,SS,TM,p450 C1001134:gi|2117372|pirl|l65981 fatty ac 

3S,AcDtyltransf2, N-acetyltransferase 1 (aryiamine N-acety 

pkinase, paternally expressed 10 

SS,hemopexin,Peptidase. M10,SS,Pep(idase_rnatrwinefalloproieinase 1 (MMP1: Infers 

kynurenine 3-monooxy_eriase (kynurenine 3 
SS,Pep1idase_M1, ESTs 

hemopexln,Peptidase_M10,SS,Peptidase„M10matrix metaSoproteinase 10 (MMP10; sir 
SS.DENN calmodulin 2 (phosphorylase kinase, delt 

SS,LysyLox;dase lysy) oxidase 

SS aldehyde dehydrogenase 9 family, member 

TM,KJetra,ion_trans,SS,TM,IOetra,ion_t potassium voltage-gated channel, delayed 
SS,LysyLox!dase,A(dose_epim,Epimerase,S lysyi oxidase 
pkinase,SS,TM,pkinase,polyprenyI_synt, NIMA (never in mitosis gene a)-related k 
SS,TM ( 71mJ,p450,rrm ESTs 

pkinase, gb:IL-BT152-080399404 BT152 Homo sapien 

death,ZU5.TM,Aclivin_reop,pkinase, gb:zt32h03.r1 Soares ovary lumor NbHOT H 
SS.Acyltransferase, KIAA1560 protein 
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423887 ALD802D7 Hs. 134585 
429B59 NM_007050Hs.225952 
426358 BE440042 Hs,33326 
416912 N!^L0006B5Hs.B9472 
451952 AL12D173 Hs.301663 
402408 NA 

415539 AI733BB1 Hs.72472 
406687 M31126 Hs.272620 
419948 A8041035 Hs.93847 
40D285 NA 

408380 AF123050 Hs.44532 
409203 AA780473 Hs.687 
424902 NM_003866Hs.1 53687 
431725 X65724 Hs.2839 
418092 R45154 Hs.106604 
427811 MB1057 Hs.180864 
423554 M90516 Hs.1574 
426261 AW242243 Hs.168670 
413374 NM_001034Hs.75319 
432677 NlvL004482Hs,278611 
45698S D38299 Hs.170917 
453331 AI240665 Hs.8895 
407721 Y12735 Hs.38018 
418004 U37519 Hs.B7539 
410555 U92649 Hs,64311 
443695 AW204099 Hs.337720 
423545 AP000692 Hs.1 29781 
401045 



C11OO1883^giJ6753278jref|NP,033938.1|c 



SS,TM,BRCT,ank > ABC_tan,ABC_tran DKFZP434G232 protein 
SS,Tf,1,Y_pSiosphatase,MAM,fn3, protein tyrosine phosphatase, receptor t 

SS.PeptidaseJ/ilO.hemopexin.SS.Peptldasejnatrix metaitoproielnase 3 (stromelysin 
SS,TM,7tmJ ,SS,TM,7tm_1 , ang lotensin receptor 1 

S5,pftlnsse, ESTs 

SS,carb_anhydrase NM_030920*:Homo sapiens hypothetical pro 

death,ZU5JM,Adiviajecp,pkinase, BMP-R1B 
SS,Peptidase_M10,hemopexin,SS, Peptidase jnatrix metalloproteinase 11 (MMP11; slro 
Ferric„reduct,TM,Ferr!c_redtict, NM_016931:Homo sapiens NADPH oxidase 4 ( 

TM,ABCJran 1 A8C_membrane, Eos Control 

SS 1 TM,ubiqtiiiin,7(m_3,ANF„receptor,sushi diublquiiift 

SS,p450,SS,p450 cytochrome P450, subfamily IVB, polypapt 

SS,SS inositol polyphosphate-4-phosphatase, ty 

SS,Cys_kr,ot,SS Norrie disease (pseudoglioma) 

death.ZUS.pkinase.Activin^recp, ESTs 
SS^n_carbOpept,Propep_M14,SS,Propep_M14carboxypeptldase B1 (tissue) 
GATase_2,SIS, gIutamine-ffuctase-6-phosphate transamin 

SS,TM, W04Q,ubiqultln,E1-E2 j\TPase,Cation peroxisomal famesylated protein 
SS ribonucleotide reductase M2 polypeptide 

SS,TW,Glycos_transf_2,Ricin_BJectin,DPP UDP-N-acetyl-alpha-D-galactosamine:polyp 
SS,TM,7lm_1 , prostaglandin E receptor 3 (subtype EP3) 

SS,TM,dlslnlegrin,Pep_M12B_propep,Reprol ESTs 

pkinase, dual-specificity tyrosine-(Y)-phosphoryl 

SS.aldedh.SS.aWedh, aldehyde dehydrogenase 3 family, member 

TM,disinlegrir,Reprolysin, a dlsMegrin and melaiioprotcinase doma 

ESTs, Weakly similar to AF126780 1 retin 
GAF.PDEase chromosome 21 open reading frame 5 

ICEj)20,SS l ICEj10 r ICE_p20,ICE_p10,ICE_p 



442082 R41823 Hs.7413 
418836 AI65549S Hs.101712 
404091 NA 

450865 AI248013 Hs.10B532 
424085 NMJ02914HS.139226 
449650 AF055575 Hs.23838 
432304 AA932186 Hs.69297 
415669 NM_005025Hs.785B9 
442942 AW167087 Hs.1'315B2 
428795 R45503 Hs.97469 
428479 Y00272 Hs.184572 
400300 X03353 
447359 NM_012093Hs,18268 
402230 NA 

424687 J05070 Hs.15173B 
432328 AI572739 Hs.195471 
432239 X81334 Hs.2936 
400286 NA 

425247 NM_005940Hs,155324 
434737 AA828246 Hs.291884 
439310 AF086120 Hs. 102793 
441111 AI806867 Hs,1 26594 
452355 N54926 Hs.29202 
427711 M31659 Hs.180406 
429353 AL1 17406 Hs.200102 
425325 X52730 Hs,1B92 

phenylelhanolamineN-methyltransferase 
448706 AW291095 Hs.21814 SS.TM.pkinase, 
403593 NA • 

432777 AA564991 Hs.269477 
446232 AI281848 Hs. 194691 
411165 NMJQ0169HS.69089 
414117 W88559 Hs.1787 
411096 U80034 Hs.68583 
450506 NM.00446DHs.418 
417975 AA641836 Hs.30085 
447752 M73700 Hs.105938 
427122 AW057736 Hs.323910 



TM,EPH_tod,pkinase,SAM,fn3, 
TM,Activ)n_recp,pkinase,death,ZU5, 
TM,7tmJ,ANF_recepior, 
zf-C2H2 

SS,AAA,ViraLhelicase1,rrm, 
TM,ion_trans,SS,TM,ion_trans, 
TM,7tm_1, 
SS,serpin, 

SS.ig.Sema.pklnase, 
SS.TM 

SS,pklnase,pfe 



6.7 

ESTs; calsyntenin-2 
ESTs 

Target Exon 

ESTs, Weakly similar to 138588 reverse i 
replication factor C (activator 1) 2 (40 
calcium channel, voltage-dependent, Lty 
ESTs 

serine {or cysteine) proteinase inhibifo 
ESTs 

ESTs, Highly similar to A39789 N-acelyll 
3li division cycle 2, G1 to S and G2 to 



SS,TM,pkinase,Recep„L„domain,SH2 > PH,FuriHER2 receptor tyrosine kinase (c-erb-b2, 
SS.adeny Wninase, adenylate kinase 5 

SS,TM,p450, Target Exon 

SS,Peptidase_M10,fn2,hemopaxIn,SS,TM,Pepmafrix metalloproteinase 9 (geiaiinase B 
6PF2K,PGAM, 6-phosphofructo-2-kinase/fructose-2,6-bi 
SS,Peptidase_M1Q,hemopexin,SS,Pepfidase_matrix metaitoproielnase 13 (collagenase 
SS,TM,ABCJran,ABC_membrane,SS . C16000922:gi|7499103|pir|[T20903 hypottie 
SS,PeptidaseJvi10 ( hemopexin,SS matrix metalloproteinase 11 (MMP11; sfro 

UCH-1,pkinase I OPR,Rhodanese,AMP-binding,ESTs 
SS.TM,UDPGT,caseinJcappa ESTs 
SS.TM.Phosphodiest, ESTs 
TM,7tm_1 ,TM G protein-coupled receptor 34 

SS solute carrier family 25 (mitochondria! 

SS,TM,AEC_tran,ABC_membrane, ATP-blnding cassette transporter MRP8 

SS,NNMT_P^f^T_TEMT,SS,NNMT_PI\lMT TEMT.STAR 



interleukin 20 receptor, alpha 
CIDE-N,pklnase TargetExon 
aipha-amylase, ESTs 
SS,TM,7fmJ3,RibosomaLI-13 retinoic acid induced 3 

SS,Melibiase,BTK,PH,pkinase,SH2 r SH3,Ribo galactosidase, alpha 
TMJortJrartSjKJetra, proteolipid protein 1 (Pellzaeus-Merzbac 

PeptldasaJB, mitochondrial intermediate peptidase 

SS,DPPIV_NJerm,Pepiidase_S9,SS,DPPI V_N ..fibroblast activation protein, alpha 
SS.trypsin hypothetical protein FU23186 

SS,transferrin,7SnLl, transferrin, lactotransferrin 
SS,TM,pkinas3,Recep_Ldomain,SH2 1 PH,FuriHER2 receptor tyrosine kinase (c-erb-b2, 
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20 



AA447453 Hs.27860 
AA687376 Hs.269533 
AA339449 Hs.82285 
AI860021 Hs.270651 
BE614743 Hs.146688 
H26735 Hs.91668 
A1345455 Hs.76915 
A191Q275 Hs.1406 
AA8633S0 Hs.26040 
AW294092 Hs.21594 
NA 



424420 
452190 
419986 
421582 
446733 
453060 
400205 



432690 AH 81490 Hs,278S27 



AL043004 Ks.79337 
R9669S Hs.35598 
NM_013257Hs.279696 
AW073310 Hs.163533 
NM 002916HS.35120 
AI345227 Hs.105448 
AA829286 Hs.332053 
NM_0D1809Hs.1594 
U38945 Hs,1174 



447754 
453775 
431657 



420139 
425071 
424511 
428722 



AL133916 Hs.172572 
Z97630 Hs.226117 
NM_005357Hs.95351 
WM,013989Hs.154424 
BE30Q512 Hs.193557 
U7B456 Hs.190787 
L22524 Hs.2256 
M3115B Hs.77439 
.313803 



444618 
408761 
427809 
418203 
454071 
424676 
457465 
417601 



426310 
417531 
444781 
431493 



425320 
431854 
407104 
449051 
434398 
454042 
407756 
401464 
412970 
412049 



AV653785 Hs.1 73334. 
AA057264 Hs.238936 
M263B0 Hs.180878 
X54942 Hs.83758 
A1041793 Hs.42502 
YOB565 Hs.151678 
AW301344 Hs.122908 
NM 014735Hs.82292 
AU076643 Hs,313 
Y13647 Hs.119597 
AW207175 Hs.1O6771 
NM_000909Hs.169266 
NM_003157Hs,1O87 
NM_014400Hs.11950 
A1791493 Hs.129873 
AF059214 Hs,194687 
AI306B76 Hs.103849 
AW895719 
NMJ03316Hs.2442 
U29344 HS.S3190 
AA383550 Hs.271699 
S5729S Hs,323910 
AW961400 Hs.333526 
AA121098 Hs.3838 
H22570 Hs.172572 
AAf 16021 Hs.38260 
AF039241 Hs.9028 
AB026436 Hs, 177534 
M53437 Hs.18268 
U25128 Hs.159499 



413453 AA129640 Hs.128065 



SS,TM,3BetaJHSD, EMSP00000171555:CDNA FU10727 fis, clone 

SS,TM,7fm_1, Homo sapiens mRNA; cDNA DKFZp586M0723 (f 

SS,pkinase,RhoGEF,lg,PH,SH3, ESTs 

AlRS,formyLtransf 1 GARS,SS,GARS 1 AIRS ( forphosphorifaosylglydnamideformyltransfer 
pkinasB ESTs, Moderately similar to A47582 B-cel 

SSJM.MAPEG, prostaglandin E synthase 

SS,TM,PH,SH2,Furin-llke,pkinase,Recep_L Homo sapiens clone PP1498 unknown mRNA 
pklnase,OPR, GA-bindlng protein transcription factor, 

SS,trefoil,SS,TM,ld|_recepi a,SRCR,tryps trefoil factor 1 (pS2) 
S$,TM,p450, ESTs, Weakly similar to fatly acid omega 

hypothetical protein MGC15754 
NMJ06265*:Homo sapiens RAD21 (S. pombe) 
gb:UI-H-BW0-ajc-c-07-0-Ul.s1 NCI_CGAP_SU 
prenylcysteina lyase 

Homo sapiens clone 23736 mRNA sequence 
SS.pkinase, KIAA01 35 protein 

SSJMtfypsin.vwd.lg ESTs 

pkinase,pkinase_C, serum/glucocorticoid regulated kinase-li 

pkinase, Homo sapiens cDNA FLJ14142 fe, clone MA 

SS,AAA,PI3_Pl4 Jinase,PI3Ka,PI3K_rbd,PI3 replication factor C (activator 1) 4 (37 
SS,TM,pktnase ESTs, Weakly similar to B34087 hypofhefl 

SS.SAAjroteins.ABCjriembraneABCjran, serum amyloid A1 
SSJM.thiolase, centromere protein A (17kD) 

ank.ank cyclMependeni kinase inhibitor 2A (me 

gb:stearoyl-CoA desaturase [human, adipo 
hypothetical protein FU20093 
H1 histone family, member 0 
lipase, hormone-sensitive 
deiodinase, Sodolbyranirie, type II 
ESTs, Moderately similar to ALU7_HUMAN A 
tissue Inhibitor of meialloproteinase 4 
3,PeptidasB_MlO,SS,Peptidase_M10,!iemDp8matrix metalloproteinase 7 (MMP7; uterin 
SS,cNMP_bSnding,Rlla,HMG . box protein kinase, cAMP-dependent, regiiiato 

SS.abhydrofase ESTs, Highly similar to AF157B33 1 none! 

SS,SH2,PH,SS,TM,PH,SH2,Furin-!lke,pkInas growth factor receptor-bound protein 7 
SS.Peptldase C1,gpdh ESTs 

p450,SS,p45O C5O00355;gil4503225|fef|NPJ00765.1| cyt 

ELL-RELATED RNA POLYMERASE II, ELONGATIO 
SS,TM,7tmJ, ESTs, Weakly similar to (defline not ava 

SS,fipase,PLAT,Sec7,PH, lipoprotein lipase 

CKS.SS.CKS, CDC28 protein kinase 2 

TM.rtmJ, ESTs 

Glycos_transf_2,Rlcin_B_lectin,SS t Glycos UDP-N-acetyl-alpha-D-galactosamlne:poiyp 
DNA replication factor 



SS,TM, 

SS,ig,pklnase,LRRNT,LRRCT, 

SS,TM,linker_hi5tone,7fm_1 

SS,TM,p450, 

SS,T4_defod)nase,T4_deiDdinase, 
SS.Yjphosphatase.Band J 1 
SSJIMP, 



>S,TM, 



jdjonjr 



SS 



KIAA0215 gene product 

' id phosphoproieln 1 (osteopontin, 



lS,7tm 1.SPRY, 
;S,TM r 7tm_1, 

;s,pkinase,vwa,vwa,GlycoJransL8 
iS,PH,lactamase_B 



ESTs 

neuropeptide Y receptor Y1 
serine/threonine kinase 2 
GPI-anchored metastasis-associated prote 
ESTs, Weakly similar to A35036 cytochrom 
cholesterol 25-hydroxylase 
i,Peplidase_M10,hemopexin,Pephypothetical protein DKFZp7S1D112 
TM,ionJians,K„tetra, gb:QV4-NN0039-29O300-154-f06 NN0Q39 Homo 

a dislntegrin and metalloproteinase doma 
AcyUransf,gdtuzinc,ketoacy!-synt,pp-bi fatty acid synthase 
IMS.SS polymerase (DNA directed) lota 

SS,TM,SH2,PH,pkinase J Recep_L_domain,Furiv-erb-b2 avian erythroblastic leukemia v 
SS H ER2 receptor tyrosine kinase (c-erb-b2, 

pklnase,POLO_box,SS,pkina5e,POLO_box, serura-induclble kinase 
SSJg.pkinase.LRRNT.LRRCT, hypothetical protein FLJ2O093 

SS.UCH-1 ,UCH-2,SS,TM,G_gluJranspepi ubiquitin specific protease 1 S 
Peptidase_M24, histone deacetylase 5 

DSPcMyosinjail, dual specificity phosphatase 10 

SS.adenylatekinase, adenyiate kinase 5 

SS,TM,7tm_2 l SS,TM,7t]TL2 parathyroid hormone receptor 2 
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407846 AA426202 Hs.40403 


TM,ABC„membrane,ABC_tran - RibosomaLS4eCbp/p300-intefaotinglransactivatar,wit 


3.4 


406925 L34041 Hs.9739 


S3,TM ) tiensporLprot,SWIB,RhoGAP,DAG_PE-glycerot-3-phosphate dehydrogenase 1 (so 


3,4 


445B73 AA25Q970 Hs.251946 


SS/rm,PABP h pkfnase,14-3-3,rrTTl 


poly{A)-binding protein, cytoplasmic 1- 


3.4 


416054 NM_002318Hs.B3354 


SS,TM,mf(o_carriLysyLb)(fdase 


tRNA isopentenyipyropnospnate transteras 


. 3.4 


406815 AA833930 HS.28B036 




3,4 


410530 M25809 Hs.64173 


ATP-synLaOiSS,7trTL.1,ATP-syiiLa" 


ATPase, H transporting, lysosomal (vacuo 


3.4 


407021 US2077 






3.4 


421168 AF1 82277 Hs.330780 


SS,p450,SS 


cytochrome P450, subfamily MB (pnenooar 


3.4 


431473 AA825686 Hs.321176 


S u 


rnr? W f^ k |f s '"f? r !° SS5 ^ reveree ' 


3.4 


408101 AW9685U4 Hs. 123073 


pklnass, 




3.4 


422083 NM_001141Hs.111256 


lipoxygenase.PLAT, 




3.3 


411393 AW797437 Hs.69771 


SS 1 sushl,lrypsln,vwa,(Tm,flbrinogert_.C|fn 


B-factor t properdin 


3.3 


435767 H73505 Hs. 117874 


SS,PeptidasB__SB,P r Peptidase_S8,P 




3.3 


433066 NM 006436Hs,288215 


SS,Pribosyltran, 


sialylt/ansferase 


3.3 


426928 AF037062 Hs.172914 


SS,adh_shorf,TGF-beta,TGFb_propeptide 


retinol dehydrogenase 5 (1 1-cls and 9-ci 


3.3 


414575 H11257 Hs.22968 


SS.pkinase.lg, 


Homo sapiens clone 1MAGE*451939, mRNA se 


3.3 


445941 AI267371 Hs.172636 


SS,SS,lipoxygenase,PLAT 




3.3 


444542 AI161293 Hs.2e0380 


SS J SS ( Fept!dase_M1,EGF l fgi!ecy'n_c l suslir 


aminopeptfdase 


3^3 


425741 AF052152 Hs.159412 


pkinase, 


Homo sapiens clone 24628 mRNA sequence 


3,3 


434228 Z42047 Hs.283976 


SS,TM,7tm_1 


Homo sapiens PR02751 mRNA, complete cds 


3.3 


433264 D85782 Hs.3229 




cysteine dloxygenase, type I 


3.3 


400419 AF084545 


SS,Peptidase_M1, 


Target 


3.3 


439750 AL359053 Hs.57664 


TM,lntegrin_B,Ririn_B_lec!in,rnn 


Homo sapiens mRNA full length insert cDN 


3.3 


417757 R19B97 Hs.106604 


death,2U5,pkinase,Activin_recp J 


ESTs 


3,3 


452194 AI094413' Hs.332649 


SS l TM,7ba.3,ANF_fBC8plor l sushi 


olfactory receptor, family 2, subfamily 


3.2 


421456 NM 003654Hs, 104576 




carbohydrate (kerafan sulfate Gat-6) sui 


3.2 


443767 BE562136 Hs.9736 


SSpCI,RasGEF,horrnona_rec,zf-C4 1 


proteasome (prosome, macropain) 26S subu 


3.2 


42264B D859B3 Hs.118693 


peroxidasaLRRCT, 


Melanoma associated gene 


3.2 


423431 AA326062 


SS,p45Q,p450 


gb:EST29171 Cerebellum II Homo sapiens c 


3.2 


451264 AI768235 


SSJrehalase 


gb:wg82g08jcl Soares_NSF_F8_9W_OT w PA_P_S 


3.2 


452110 T47667 Hs.28005 


SSJM.Activin recp.pkinase 


Homo sapiens cDNA FLJ11309 fis, done PL 


3.2 


439963 AW247529 Hs.6793 


TM,p4S0,Ets 


platelet-activating factor acetylhydrola 


3,2 


453941 U39317 Hs.36820 


SS,DEAD,HRDC,h3licase C, 


Bloom syndrome 


3.1 


405664 L34041 Hs.9739 


SS,TM,tran5port_pro1,SWIB,RhoGAP I DAG_PE- 


giyceroi-3- 


phosphate dehydrogenase 1 (so 


3.1 






453487 R31770 Hs.23540 


TMJtmJ, 


ESTs 


3.1 


420911 U77413 Hs,100293 




(Minted N-acetylglucosamine (GlcNAc) ir 


3.1 


443171 BE281128 Hs.9030 


SS,TM,7tm 1,rrm,SS 


TONDU 


3.1 


452256 AK000933 Hs.28661 


TM,GDI,7tm_1, 


Homo sapiens cDNA FLJ10071 fis, clone HE 


3.1 


432201 AI538613 Hs.298241 


SS,TM,tr/psin,SS,TM,trefoil,trypsin,ifef 


Transmembrane protease, serine 3 


3.1 


419150 T29618 Hs.89640 


TM,pMnase,fn3, 


TEK tyrosine kinase, endothelial (venous 


3.1 


444443 AI1492B6 Hs.55099 


SS 


rab6 GTPase activating protein (GAP and 


3.1 


426283 NM 003937HS.169139 




kynureninase {L-kyrturenlne hydrolase) 


3.1 


436291 BE568452 Hs.5101 


SS,abhydro!ase, 


protein regulator of cytokinesis 1 


3.1 


450223 AA418204 Hs.241493 


SS.proJsomerase, 


natural WIlBf-iumorrecoonltion sequenc 


3.1 


424269 AW137691 Hs.199754 


SS,TR7tm_2,GPS 


ESTs 


3.1 


448105 AW591433 Hs.298241 


SS p TM J trefo1l,Uyp5ia I trefoll 


Transmembrane protease, serine 3 


3.0 


452560 BE077084 Hs.336432 


SS,rrm,rf-RanBP > pklnase,C2, pklnase_C,DAG ESTs 


3.0 



318 



WO 02/059377 



PCT/US02/02242 



TABLE 21A 

Table 21 A shows the accession numbers for those pkeys lacking unigenelD's for Table 21. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset Identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 

420854 197072.1 AW296927 ASS64514 AI263169 AA281079 

423431 228162J AA326062 M325758 AW962162 

423945 233566J AA41 0S43 AW948953 AA334202 AA332882 

451264 863988 1 AI768235 R314Q0 K29082 H23107 

455325 1279475J AW895719 N31451 N41451 

456207 155078 -1 ' 
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TABLE 21B 

Table 21B shows the genomic positioning for those pkeys lacking unigene ID's. and 
accession numbers in Table 21. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos prabeset 

Ref; Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999} 402:189495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

401045 8117619 Plus 90044-90184,91111-91345 

402230 9966312 Minus 29782-29932 

402408 9796239 Minus 110326-110491 

402578 98B4928 Plus 

403593 3652550 Minus 



100742-100904,101322-101503 
82121-83229 
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TABLE 22: 739 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

Table 22 shows 739 genes up-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 19, except that the ratio was greater than or equal to 3.0, the 
denominator was the 85 th percentile value for 12 non-malignant breast specimens, and the 
96 th percentile value amongst the 73 breast cancers was greater than or equal 100 miits. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank a 

UnigenelD: Unigene number 

Unlgene Title: Unigene gene title 

R1: Ratio of 9 nih percentile tumor to 85 th percentile normal breast tissue 



Pkey ExAccn UnigenelD UnigencTitle R1 

400292 AA250737 Hs.72472 BMP-R1B 51.5 

424735 (J31875 Hs.272499 short-chain alcohol dehydrogenase family 38.3 

400297 AI1 27076 Hs.334473 hypothetical protein DKFZp56401278 29.9 

431448 AL137517 Hs.334473 hypothetical protein DKFZp5G401278 26.9 

451110 AI95504D Hs.265398 ESTs, Weakly similar !o fransformation-r 25.8 

431 21 1 MBSB49 Hs.323733 gap junction protein, bela 2, 26kD (conn 23.2 

418203 X54S42 Hs.83758 CDC28 protein kinase 2 22.6 
407980 AA046309 gb:zf12f01,s1 SoaresJetaLhearf_NbHH19W 19.8 

414S46 AA353776 Hs,9Q1 CD48 antigen (B-cell membrane protein) 18.9 

445921 AB012113 Hs.16530 small inducible cytokine subfamily A (Cy 18.0 

409041 AB033025 Hs.50D81 KIAA1 199 protein 17.6 

412140 AA219691 Hs.73625 RAB6 Interacting, kinesin-fike (rabkines 17.6 

407824 AA147S84 Hs;9812 Homo sapiens cDNA FLJ143B8 fis, clone HE 17.1 

453160 AI263307 Rs.239884 H2Bhistone family, member L 17,0 

407137 T97307 gb:ye53h05.s1 Scares fetal liver spleen 16.1 

425692 D9D041 Hs, 1 55956 N-acetyltransferase 1 (arylamlne N-acety 16,1 

438533 AI440266 Hs.170673 ESTs, Weakly similar to T24832 hypothetl 16.0 

42B227 AA321649 Hs.2248 small Inducible cytokine subfamily B {CX 15.5 

444342 NM_014398Hs.10837 similar to lysosome-associated membrane 15.1 

422505 AL120862 Hs.124165 programmed ceil death 9 (PDCD9) 14.9 

430515 AA746503 Hs.283313 ESTs 14.7 

417308 H60720 Hs.81892 KIAA0101 gene product 14.4 

452744 AI267652 Hs.30504 Homo sapiens mRNA; cDNA DKF7p434E082 (fr14.4 

412446 AI76B015 Hs.92127 ESTs 14.2 

415539 AI733881 Hs.72472 BMP-R1B 14.1 

435496 AW840171 Hs.265398 ESTs, Weakiy sirriiarto iransfoimat/cn-r 13.8 

43B209 AL120659 Hs.6111 aryl-hydrocarbon receptor nuclear transl 13.8 
40D205 NA NM_006265':Homo sapiens RAD21 (S. pombe)13.5 

4309S5 AA489732 Hs, 154918 ESTs 13.4 

415263 AA948033 Hs.1 30653 ESTs 13.3 

451952 AL120173 Hs.3016S3 ESTs 13.2 

449722 BE280074 Hs.23960 cyclin B1 ' . 13.2 

406685 M18728 gb:Human nonspecific crossreacting antig 13.0 

406690 M29540 Hs,220529 carcinoembryonic antigen-related cell ad 12.8 

429925 NM_000786Hs.226213 cytochrome P450, 51 (lanosterol 14-alpha 12.8 

416498 U33B32 Hs.79351 potassium channel, subfamily K, member 1 12.7 

432378 AI493046 Hs.146133 ESTs 12.5 

441377 BE218239 Hs.202656 ESTs 12.5 
455207 AA193450 g b:zr4Oe07,r1 Soares_NhHMPu_S1 Homosapi12.4 

422805 AA4369B9 Hs.121017 H2A hlstone family, member A 12.2 

407811 AW190902 Hs.40098 cysteine knotsuperfamily 1, BMP antagon 12.2 

407178 AA1 95651 Hs.104106 ESTs 12.2 
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420931 AF044197 Hs.1 00431 small Inducible cytokine B subfamily {Cy 12.1 

421727 Y131S3 Hs.10731B kynurenine 3-morooxygenase (kynurenine 3 12,0 

434408 AI031771 Hs,132586 ESTs 12.0 

446591 H44186 Hs.15455 PDZ dcmaln containing 1 11.9 

4313B5 BE17B536 Hs.11090 membrane-spanning 4-domalns, subfamily A 11.8 

44334B AW873396 Hs.162278 calmodulin 2 (phosphorylase kinase, delt 11.7 

416602 NfvLO0S159Hs.79389 nel (chicken)* 2 11.7 

433355 AF026944 Hs,293797 ESTs 11.6 

437BB6 AA156781 Hs.74170 metailothion'ein 1E {functional) 11.5 

412472 AW975398 Hs.293S36 ESTs 11.4 

41B030 H15261 Hs.21948 ESTs 11.3 

439979 AW600291 Hs.6823 hypothetical protein FLJ10430 11.3 

420757 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 11.3 

411598 BE336654 Hs.70937 H3 histone family, member A 11.2 

423600 AI633559 Hs.310359 ESTs 11.2 

430770 AA765694 Hs.123298 ESTs 11.0 

421037 AI684B08 Hs.197653 programmed cell death 9 (PDCD9) 10.9 

452461 N78223 Hs.1 08106 transcription factor 10.7 

409269 AA576953 Hs.22972 hypothetical protein FU 13352 10.6 

417791 AW965339 Hs.111471 ESTs 10.6 

447258 AI370413 Hs.36563 hypothetical protein FLJ2241 8 1 0,4 

424001 W67833 Hs.137476 paternally expressed 10 10.4 

447342 AI199268 Hs.19322 Homo sapiens, Similar to RIKENcDNA 20 10 10.4 

424905 NM_002497Hs.153704 NIMA (never in mitosis germ a)-relafed k 10.1 
453519 HB7648 Hs.33922 Homo sapiens, clone MGC;9084, mRNA, compl 0.1 

442942 AW1670B7 Hs.131562 ESTs 10.1 

434377 AW137148 Hs.306593 Homo sapiens cDNA FLJ1 1382 fis, clone HE 10,1 

427217 AA399272 Hs.144341 ESTs 10.1 

445730 AI624342 Hs.1 70042 ESTs 10.0 

432887 AI926047 Hs.1 62859 ESTs 10,0 

452243 AL355715 Hs.28555 programmed cell death 9 9.9 

424590 AW966399 Hs.46821 hypothetical proioin FLJ20086 9.9 

432169 Y00971 Hs.2910 phosphoribosyl pyrophosphate synthetase 9.9 

438950 H23789 Hs.144530 EST 9.9 

418836 AI655499 Hs.161712 ESTs 9.8 

430291 AV660345 Hs.238126 CGI-49 protein 9.8 

444665 BE613125 Hs.47783 B aggressive lymphoma gene 9.7 
407377 C 1 6391 gb:C16391 Clontech human aorta polyA mRN 9.7 

445413 AA151342 Hs.12677 CGM47 protein 9.7 

443462 AI064690 Hs.171176 ESTs 9.7 

442145 AI022650 Hs.8117 erbb2-lnteracting protein ERBIN 9.7 

435570 AF212222 Hs.177812 uncharacterized bone marrow protein BM04 9.7 

439B20 AL360204 Hs,283853 Homo sapiens mRNA full length insert cDN 9.6 

42B956 AF059214 Hs,194687 cholesterol 25-hyc'roxylass 9,6 

449448 D60730 Hs.57471 ESTs 9.6 

433929 AI375499 Hs.27379 ESTs 9.5 

432731 R31 178 Hs.287820 fibronedln 1 9.3 

411815 AA155679 Hs.12579D ieucine-rich repaat-confaining 2 9,3 

415385 R17798 Hs.7535 CQBVWike protein 9.3 

422026 U80736 Hs.110826 trinucleotide repeat containing 9 9.2 

432596 AJ224741 Hs.278461 matrilin 3 9.2 

439451 AF086270 Hs.278554 heferochromatin-iike protein 1 9,2 

423945 AA410943 gb:zt32h03.rt Soares ovary iumorNbHOTH 9.1 

442432 BE093589 Hs.38178 hypothetical protein FLJ23458 9,1 

446715 AI337735 Hs.173919 ESTs, Moderately similar to ZN91_HUMANZ 9.0 

408771 AW732573 Hs.47584 potassium voltage-gated channel, delayed 9.0 

437021 A1076089 Hs.292239 ESTs 9.0 

428479 Y00272 Hs, 1 84572 cell division cycle 2, G1 to S and G2 (o 8,9 

428339 AI767755 Hs.62302 Homo sapiens cDNA FLJ14814 fis, clone NT B.9 

402408 NA NM_03Q920*:Homo sapiens hypothetical pro 8.9 

418501 AA279490 Hs.86368 catmegin 8,8 
426327 W03242 Hs.44B98 Homo sapiens clone TCCCTA001 51 mRNA sequ8,8 

419519 AI198719 Hs.176376 ESTs 8.8 

440621 AW29S024 Hs.150434 ESTs 8.8 

446142 AI754593 Hs.145968 ESTs B.8 

418196 A1745549 Hs.26549 KIAA1708 protein 8.7 

447178 AW594641 Hs.192417 ESTs 8.7 
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427565 D31 152 Hs.179729 collagen, typeX, alpha 1 {Schmld metaph 8.6 

415657 AA866115 Hs.127797 Homo sapiens cDNA FLJ113B1 fis, clone HE 8.6 

4350B1 AI651474 Hs,163944 ESTs 6.6 

431374 BE25B532 Hs.251871 CTP synthase 8.4 

417866 AW067903 Hs.82772 collagen, type XI, alpha 1 8.4 

437211 AA382207 Hs.5509 ecotroplc viral integration site 2B 8.3 

437751 AA767373 Hs.35669 ESTs, Moderately similar to ALU1_HUMAN A 8.3 

423887 AL080207 Hs.134585 DKFZP434G232 protein 8.2 

440941 BE268362 Hs.7535 COBW-like protein 8,2 

429859 NM_007050Hs.225952 protein tyrosine phosphatase, receptor t 8.2 

410193 AJ132592 Hs,59757 zinc finger protein 281 8.2 

431725 X65724 Hs.2839 Norrie disease (pseudoglioma) 8.1 

446258 AI283476 Hs.263478 ESTs 8.1 

416747 AW876523 Hs.15929 hypothetical protein FLJ12910 8.1 

434424 AI811202 Hs.325335 Homo sapiens cDNA: FLJ23523 fis, clone L 8.1 

421650 AA781795 Hs.122587 ESTs 8.0 

429534 AW976987 Hs.163327 ESTs, Weakly similar to 2109260AB cell 8.0 

457465 AW301344 Hs.122908 DNA replication factor 8.0 

427961 AW293165 Hs.143134 ESTs 8.0 

436481 AA379597 Hs.5199 HSPC150 proiein similar to Ubiquifin-con 8.0 

418216 AA662240 Hs.283099 AF15q14 protein 8.0 

418250 U29926 Hs.83918 adenosine monophosphate deaminase (isofo 7.9 

400285 MA Eos Control 7.9 

401464 AF039241 Hs.9028 hlstone deacety)ase5 7,9 

407242 M18728 gbiHuman nonspecific crossreacting antig 7.8 

422232 D43945 Hs.1 13274 transcription factor EC 7.8 

454024 AA993527 Hs,2939Q7 hypothetical protein FU23403 7.8 

444542 AI161293 Hs.280330 aminopeptidase 7.8 

436396 A1683487 Hs.152213 wingless-type MMTV integration site fami 7.7 

437204 AL110216 Hs.12285 ESTs, Weakly similar to 155214 salivary 7,6 

408805 H69312 Hs.48269 Vaccinia related kinase 1 7.6 

437207 T27503 Hs.15929 hypothetical protein FU12910 7.6 

442B16 AK001741 Hs.8739 hypothetical protein FIJ10879 7,6 

426283 NMJ)03937Hs.169139 kynnrenlnase {L-kynurenhe hydrolase) 7.5 

424687 J0507Q Hs.151738 matrix metallopro'einase 9 (gelatinase B 7.5 

446315 NM_016293Hs.14770 bridging integrator 2 7,5 

433426 H69125 Hs.133525 ESTs 7.5 

405639 M97711 gb:Human T-ceK receptor (V beta 18,1, J 7.5 

420077 AW512260 Hs.87767 ESTs 7.4 

457332 AA961694 Hs.105187 klnesin protein 9 gene 7,4 

422938 NM_001809Hs.1594 centromere protein A (17kD) 7.4 

447555 A1391662 Hs.160963 Homo sapiens, clone MGC;12318, mRNA, com7.4 

444618 AV653785 Hs.173334 ELL-RELATED RNA POLYMERASE II, ELONGATI07.3 

410361 BE391804 Hs.62661 guanylate binding protein 1, interferon- 7.3 

400268 NA NM_G03292:Homo sapiens translocated prom 7.3 

439509 AF086332 Hs.5B314 ESTs 7.3 

407771 AL1 38272 Hs.62713 ESTs 7.3 

407202 N58172 Hs.109370 ESTs 7.3 

433096 AU078803 Hs.282975 carboxylesterase 2 (intestine, liver) 7.2 

422094 AF1 29535 Hs.272027 F-box only protein 5 ' 7.1 

430832 AI073913 Hs.1006B6 ESTs, Weakly similarto JE0350 Anterior 7.1 

430287 AW182459 Hs.125759 ESTs, Weakly similar to LEU5_HUMAN LELXE7.0 

423739 AA398155 Hs,97600 ESTs 7.0 

448212 AW75858 gb:tcS7d07.x1 NC1_CGAP_CLL1 Homosapieris7.0 

407277 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 7.0 

454440 BE062906 Hs.28338 KIAA1 546 protein 7.0 

444783 AK001468 Hs.621B0 anillln (Drosophiia Scraps homolog), act 7.0 

421373 AA808229 Hs.1 67771 ESTs 6.9 

431960 AW241821 Hs.301927 C6.1A 6.9 

424704 AI263293 Hs. 152095 cytochrome P450, subfamily IIJ [arachido 6.8 

449517 AW500106 Hs.23643 serine/threonine protein kinase MASK 6.3 

439S4D AW449211 Hs.1 05445 GDNF family receptor alpha 1 6.8 

414080 AA135257 Hs,47783 B aggressive lymphoma gene 6.8 

441243 AI76705S Hs,193002 ESTs 6,7 

408380 AF1 23050 Hs.44532 diublquitin 6.7 

422956 BE545072 Hs.122579 hypothetical protein FLJ10451 6.7 

446651 AA393907 Hs.97179 ESTs 6.7 

323 



WO 02/059377 



PCT/US02/02242 



419839 U24577 Hs.93304 phosphoilpase A2, group VII (plalelet-ac 6.7 

437740 AA810265 Hs.122915 ESTs 6.7 

421582 AE91027-S Hs.1406 f refoii factor 1 6.7 

427356 AW023482 Hs.97849 ESTs 6.6 

5 429597 NMJ)03818Hs.2442 a disiniegrin and melalioproteinase doma 6.6 

422634 NM„016010Hs.1 18821 CGI-62 protein 6.6 

421072 A1215069 Hs.89113 ESTs 6.5 

427718 AS798680 Hs.25933 ESTs 6.5 

411000 N40449 Hs.201619 ESTs, Weakly similar to S383B3 SEB43 pro 6.5 

10 449343 AI151418 Hs.272458 protein phosphatase 3 (formerly 2B), cat 6.4 

409757 NM.001898Hs.1231 14 cystatinSN 6.4 

447164 AF026941 Hs.17518 Homo sapiens cig5 mRNA, partial sequence 6.4 

453938 X52509 Hs.161640 tyrosine aminotransferase 6.4 

418848 AI820961 Hs.193465 ESTs 6.4 

15 424902 NMJ03866HS.153667 Inositol polyphosphate-4-phosplialase, ty 6.4 

452838 U65011 Hs.30743 preferential expressed antigen in mela 6.4 

439452 AA918317 Hs.57987 B-cell CLUymphoma 118 (zinc finger pro 6.4 

407266 AJ235664 gb:Homo sapiens mRNA for immunoglobulin 6.3 

411078 AI222020 Hs.182364 CocOaCrisp 6.3 

20 433001 AF217513 Hs.2799D5 clone HQ031QPRO0310p1 6.3 

434340 AH 93043 Hs.1286B5 ESTs, Weakly similar to T17226 hypotheti ,6.2 

429503 AA3941B3 Hs.26873 ESTs 6.2 

402578 C 1 001 1 34:gi[21 1 7372|plr| |I6598 1 fatty ac 6.2 

409646 AW161391 Hs.709 deoxycytidins kinase 6.1 

25 430447 W17064 Hs.332848 SWI/SNF related, matrix associated, acii 6.1 

432415 T16971 Hs.289014 ESTs, Weakly similar to A43932 mucin 2 p 6.1 

443709 AI062692 Hs.134662 ESTs 6.1 

420929 AI694143 Hs.296251 programmed cell death 4 6.1 

428368 BE440042 Hs.83326 matrix melalioproteinase 3 (stromelysln 6,1 

30 428248 AI126772 Hs.40479 ESTs 6.0 

420344 BE4S3721 Hs.97101 putative G protein-coupled receptor 6.0 

453392 U23752 Hs.32964 SRY (sex determining region Y)-box 11 6.0 

425397 J040BB Hs. 1 56346 topolsomerase (DNA) II alpha (170kD) 6.0 

418007 M13509 Hs.83169 matrix metallaproleinase 1 (MMP1; infers 6.0 

3 5 428585 AB007863 Hs.185140 KIAA0403 protefn 6.0 

43760B AA761605 Hs.292308 ESTs, Weakly similar to ALU1_HUMAN ALU S6.0 

427408 AA583206 Hs,215S RAR-reiafed orphan receptor A 6.0 

406687 M31 125 Hs.272620 matrix metalloproieinase 11 (MMP11;stro 6,0 

418092 R45154 Hs.106604 ESTs 6,0 

40 447051 AW139130 Hs.160951 ESTs, Weakly similar to Co:t1 [11. sapiens] 6.0 

441233 AA972965 Hs.135568 ESTs 6.0 

432239 X81334 Hs.2936 matrix metalloproteinase 13 (collagenase 6.0 

435106 AA100847 Hs.193380 ESTs, Highly similar to AF1 74600 1 F-box 5.9 

435525 AI831297 Hs. 123310 ESTs 5.9 

45 458809 AW972512 Hs.20985 sin3-assoclated polypeptide, 30kD 5.9 

410785 AW803341 gb;IL2-UM0079-O903Q0-05O-DQ3 UM0079 Homo5.9 

422576 BE548555 Hs.118554 CGI-83 protein 5.9 

451398 A1793124 Hs.144479 ESTs 5.9 

441881 AW96B904 Hs. 179566 hypothetical protein FLJ22624 5.8 

5 0 412022 AI005043 Hs.24143 Wiskott-Aldrich syndrome protein Interac 5.8 

416636 N32536 Hs.42645 solute carrier family 16 (monocarboxy)ic 5.8 

447350 AI375572 Hs. 172634 ESTs 5.8 

434094 AA305599 Hs.238205 hypothetical protein PRO2013 5.8 

409151 AA303105 Hs.50785 SEC22, vcsiclo trafficking protein (S. c 5.8 

5 5 448807 AI571940 Hs.7549 ESTs 5.8 

452281 T93500 Hs.28792 Homo sapiens cDNAFLH 1041 fis, clone PL 5.8 

421281 A1299139 Hs.17517 ESTs 5.8 

430361 A1033965 Hs.239926 sterol-C4-methvl oxldase-like 5.8 

400289 X07820 Hs.2258 matrix metalloproteinase 10 (MMP10;str 5.7 

60 440527 AV657117 Hs.184164 ESTs, Moderately similar to S65657 alpha 5.7 

434674 AA831879 Hs.136985 ESTs 5.7 

426320 W47595 Hs.1 69300 transforming growth factor, beta 2 5.7 

452401 NMJ!07115Hs.29352 tumor necrosis factor, aipha-lnduoed pro 5.7 

448663 BE614599 Hs.108823 hypothetical protein MGC14797 5.7 

65 438199 AW016531 Hs.122147 ESTs 5.7 

446203 Z47553 Hs.14286 flavin containing monooxygenase 5 5.7 

428336 AA503115 Hs.163752 microsemrnoproteln, beta- 5.6 
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430379 AF134149 Hs,240395 potassium channel, subfamily K, member 6 5,6 

422835 BE218705 Hs.121378 mefalloihionein-like 5, testis-specific 5.6 

444756 AL044878 Hs.11899 3-hydroxy-3-melhy!gtutaryi-Coenzyme A re 5.6 

443426 AF098158 Hs.9329 chromosome 20 open reading frame 1 5.6 

5 400301 X03635 Hs.1657 estrogen receptor 1 5.6 

447078 AW8B5727 Hs.301570 ESTs 5.6 

432015 AL157504 Hs.159115 Homo sapiens mRNA; cDNA DKFZp5860G724 (f5.5 

438691 AA908288 Hs.212184 ESTs 5,5 

439B09 R41395 Hs.101774 hypothetical protein FU23045 5.5 

10 415786 AW419196 Hs.257924 hypoUietical prolein FLJ137B2 5.5 

456373 BE247706 Hs.89751 membrane-spanning 4-domains, subfamily A 5.5 
401645 NA C18fl01440*:gl|12330704|gb]AAG52890.1)AF35.5 

437967 BE277414 Hs,5947 mel transforming oncogene (derived from 5.5 

445885 AI734009 Hs. 127699 KIAA1603 protein 5.4 

15 439138 AI742605 Hs. 193696 ESTs 5.4 

440270 NM_015986Hs.7120 cytokfne receptor-like molecule 9 5.4 

437536 X91221 Hs.144465 ESTs 5.4 

438167 R28363 Hs.24286 ESTs 5.4 

452741 BE392914 Hs.30503 Homo sapiens cDNAFLJH 344 fis, clone PL 5.4 

20 426214 H59846 Hs.128355 ESTs, Moderately similar to ALU7_HUMAN A 5.4 

413554 AA319146 Hs.75426 secretogranin II (chromogranin C) 5.4 

422BB7 L32137 Hs.1584 cartilage oligomeric matrix prolein (COM 5.4 

434263 N34895 Hs.44648 ESTs 5.4 

446382 AW20516B Hs.150B23 ESTs 5.4 

25 422406 AF025441 Hs.116206 Opa-lnteracfing protein 5 5.3 

438321 AA576635 Hs.6153 CGI-48 protein 5.3 

418310 AA8141D0 Hs.86693 ESTs 5.3 



419625 U91616 Hs.91640 nuclear factor of kappa light polypepttd 5,3 

450701 H39960 Hs.288467 Homo sapiens cDN'AFU122B0 fis, clone MA 5.3 

3 0 445900 AF070526 Hs.1 3429 Homo sapiens clone 24787 mRNA sequence 5.2 

449051 AW961400 Hs.333526 HER2 receptor tyrosine kinase (c-erb-b2, 5.2 



418758 AW959311 Hs.172012 hypothetical profein DKFZp434J037 5.2 

431070 AW40B164 Hs.249184 transcription factor 19 (SC1) 5.2 

417079 U65590 Hs.81134 interleukin 1 receptor antagonist 5.2 

3 5 421928 AF013758 Hs.109643 poiyadenylate binding protein-intera* 5.2 

428804 AK000713 Hs.f93736 hypothetical protein FU20706 5.2 

427427 AF077345 Hs. 177936 ESTs 5.2 

403485 C3001813*:gi|12737279|ref|XPJ)12163.1!k 5.2 

422168 AA5S6894 Hs,112408 S100 calcium-binding protein A7(psarias 5.1 

40 421937 ASB78857 Hs.109706 hematological and neurological expressed 5.1 

428752 X69490 Hs.172004 titln 5.1 

453310 X70697 Hs.553 solute carrier family 6 (neurolransmitte 5.1 

423198 M81933 Hs.1634 cell division cycle 25A 5.1 

412281 AI810054 Hs.14119 ESTs 5.1 
45 447513 AW955776 Hs.313500 ESTs, Moderately similar to ALU7_HUMAN A 5.1 

453931 AL121278 Hs.25144 ESTs 5.1 

404347 Target Exon 5.1 

431 80B M3O703 Hs.270333 amphlregulin (schwannoma-derived growth 5.1 

429113 D28235 Hs.196384 prostaglandin-endoperoxide synthase 2 (p 5.1 

5 0 436291 BE568452 Hs.5101 protein regulator of cytokinesis 1 5.1 

450603 R43646 Hs.12422 ESTs 5.1 

434725 AK0D0796 Hs.4104 hypothetical protein 5,0 ' 

435981 H74319 Hs.188620 ESTs 5.0 

407376 AA993138 Hs.142287 ESTs, Weakly similar to ALUF HUMAN Nil 5.0 

55 431689 AA305688 Hs.267695 UDP-Gal:betaGlcNAcbeta 1,3-galactosyltr 5.0 

405348 NA C7001664:gl|12698061|dbJ[BAB21849.1j(AB 5.0 

436196 AK0010B4 Hs.333498 Homo sapiens cDNA FLJ10222 fis, clone HE 5.0 

437065 ALO36450 Hs.103238 ESTs 5.0 

410196 AI936442 Hs.59B38 hypothetical protein FLJ10808 5.0 

60 429412 NM_00B235Hs.2407 POU domain, class 2, associating factor 5.0 

446619 AU076643 Hs.313 secreted phosphoprotein 1 (osteopontin, 4.9 

403329 NA Target Exon 4.9 

442875 BE623003 Hs.23625 Homo sapiens clone TCCCTA00142 mRNA sequ4.9 

442441 AI820662 Hs.129598 ESTs 4.9 

65 430375 AW371 048 Hs.93758 H4 histone family, member H 4.9 



424128 AW966163 gb:EST378236 MA3E resequences, MAGI Homo4.9 

408873 AL04S01 7 Hs.1 82278 calmodulin 2 (phosphorylasa kinase, delt 4.9 
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407910 AA650274 Hs.41295 fibronectin leucine rich transmembrane p 4.9 

432606 NM_002104Hs.3066 granzyme K (serine protease, granzyme 3; 4,9 

453204 R10799 Hs.19199Q ESTs 4.6 
452020 AA722012 Hs,255757 ESTs, Weakly similar to AT2A_HUMAN POTEN4.8 

5 449048 245051 Hs.22920 simllarto S6S401 (cattle) glucose Indue 4.6 

408369 R38438 Hs.182575 solute carrier family 15 (H??? transport 4.8 

431645 AF078S49 Hs.266483 dynein light chain-A 4.8 

423575 C18363 Hs.163443 Homo sapiens cDNA FLJ1 1576 fis, clone HE 4.6 

444246 H932B1 Hs.1Q710 hypothetical protein FLJ20417 4.8 

10 421524 AA312082 Hs.105445 GDNF family receptor alpha 1 4.8 

452827 AI571835 Hs.55468 ESTs 4.6 

414222 AL135173 Hs.878 sorbitol dehydrogenase 4.8 

456086 AL161999 Hs.77324 eukaiyotic translation termination facto 4:8 

419078 M93119 Hs.89584 insulinoma-associated 1 4.8 

15 418973 AA233056 Ks. 191518 ESTs 4.8 

447033 AI357412 Hs.1 57601 ESTs 4.8 

451621 AI879148 Hs.26770 fatty acid binding protein 7, brain 4.7 

41996S X04430 Hs.93913 interieukin 5 (Interferon, beta 2) 4.7 

424326 NMJ14479HS.145296 dislntegrin protease 4.7 

20 4315B5 BE242803 Hs.262823 hypothetical protein FU10326 4.7 

429294 AA095971 Hs.198793 Homo sapiens cDNA: FLJ22483 fis, clone H 4.7 

416814 AW192307 Hs,80042 dolichyl-P-Glc:Man9G!cNAc2-PP-doiichylgl 4.7 

439897 NM.01531DHs.6763 KIAA0942 protein 4.7 

429687 A1675749 Hs.211608 nucleoporin 153kD 4.7 

25 422880 AF2237Q4 Hs.121524 glutathione reductase 4.7 
405801 NM.000390:Homo sapiens choroideremia (Ra4.6 

432435 BE218886 Hs.282070 ESTs 4,6 

439544 W26354 .Hs.23891 hypotheiical protein FU11360;ariemisp 4.6 

425354 U62027 Hs.155935 complement component 3a receptor! 4.6 

3 0 436027 A1864053 Hs.39972 ESTs, Weakly similar to I38588 reverse t 4.6 

424623 AW963062 Hs.337404 ESTs 4.6 

403366 NA Target Exon 4.6 

402542 Target Exon 4.6 

450193 AI916071 Hs.15607 Homo sapiens Fanconi anemia complementat 4.6 

35 411678 AI907114 Hs.71465 squaiene epoxidase 4.6 

456844 AI264155 Hs.152981 CDP-diacylglycerol synthase (phosphatide 4,6 

448072 AI459306 Hs.249D8 ESTs 4,5 

408045 AW138959 Hs.245123 ESTs 4.5 

423782 AI472209 Hs.323117 ESTs 4.5 

40 447388 AW630534 Hs,76277 Homo sapiens, clone MGC:S381, mRNA, comp4.5 

448140 AF146761 Hs.20450 BCM-like membrane protein precursor 4.5 

452561 AI692181 Hs.49169 KIAA1 634 protein 4.5 
425331 AW962128 gb;EST374201 MAGE resequences, MAGG Homo4.5 

428801 AW277121 Hs.254881 ESTs 4.5 

45 428500 AI815395 Hs.184641 fatty acid desaturasB 2 4.5 

426075 AW513691 Hs.270149 ESTs, Weakly similar to 2109260AB cell 4.4 

437259 AI377755 Hs.120695 ESTs 4.4 

400409 AF153341 Hs.283954 Homo sapiens winged helWforkhead trans 4.4 

412863 AA121673 Hs.59757 zinc finger protein 281 4.4 

50 426989 AI815206 Hs.99395 ESTs 4.4 

401866 Target Exon 4.4 

4.18819 AA228776 Hs.191721 ESTs 4.4 

406348 Target Exon 4,4 

412138 AW895387 gb:QV4-NN0038-30Q300-157-c10NN0038Homo4.4 

55 42B550 AW297880 Hs,98661 ESTs 4.4 
411743 AW862214 gb:QV4-CT0361-301299-074-b05 CTQ361 Homo4.4 

429936 BE081342 Hs.283037 HSPC039 protein 4.4 

423291 NMJ)Q4129Hs.126590 guanylate cyclase 1, soluble, beta 2 4.4 

423456 AL11D151 Hs.128797 DKFZP586D0824 protein 4,4 

60 452190 H26735 Hs,91668 Homo sapiens clone PP1498 unknown mRMA4,4 

424671 NM_004525Hs.153595 low density llpoprotein-relafed protein 4.3 

42957S AA706003 Hs.99387 ESTs 4.3 

429922 Z97630 Hs.2261 17 H1 histane fatmy, member 0. 4.3 

421379 Y15221 Hs.1039B2 small inducible cytokine subfamily B (Cy 4.3 

65 400300 X03363 HER2 receptor tyrosine kinase (c-erb-b2, 4.3 

437258 AL041243 Hs.174104 ESTs 4.3 

446595 T57448 Hs. 15467 hypothetical protein FLJ20725 4.3 
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403011 NA ENSP00000215330*:Probableserine/threoni 4.3 

419055 AI3653B4 Hs.11571 Homo sapiens cDNA FLJ 1 1570 fis, clone HE 4.3 

41B661 NM_001949Hs.1189 E2F transcription factor 3 4.3 

407786 AA6S753B Hs.38972 tetraspan 1 4.3 

5 429183 AB014604 Hs.197955 KIAA0704 protein 4.3 

442914 AW188551 Hs.99519 hypothetical protein FLJ 14007 . 4,3 

441029 AI091795 Hs.179246 ESTs 4.3 

452194 AI694413 Hs.332649 olfactory receptor, family 2, subfamily 4.3 

414821 M63635 Hs.77424 Fc fragment of IgG, high affinily la, re 4,2 

1 0 410102 AW24850B Hs.279727 Homo sapiens cDNA FLJ14035 lis, clone HE 4.2 

4521 10 T47667 Hs.28005 Homo sapiens cDNA FLJ11309 lis, clone PL 4,2 

442007 AA301116 Hs.142838 nucleolar phosphoprotein Nopp34 4.2 

417318 AW953937 Hs.12891 ESTs 4.2 

431818 AW510444 Hs.191705 ESTs, Weakly similar to T47184 hypolhetl 4.2 

15 443646 AI085198 Hs.164226 ESTs 4,2 

419169 AW851980 Hs.262346 ESTs, Weakly similar to S7Z482 hypotheti 4,2 

446839 BE091926 Hs,16244 mitotic spindle coiisc'-coll related prot 4.2 

423242 AL039402 Hs. 125783 DEME-6 protein 4.2 

432116 AA902953 Hs.308538 ESTs 4.2 

20 40903B T97490 Hs,50Q02 small inducible cytokine subfamily A {Cy 4.2 

445B25 BE24S743 Hs.288529 hypothetical protein FLJ22635 4.2 

425139 AW630488 Hs.325820 protease, serine, 23 4.2 

447397 BE247676 Hs.1S442 E-1 enzyme 4.2 

410166 AK001376 Hs.59346 hypothetical protein FLJ10514 4.1 

25 437295 AW779318 Hs.88417 ESTs 4.1 

430486 BE062109 Hs.241551 chloride channel, calcium activated, fam 4.1 

441790 AW294909 Hs.1 32208 ESTs 4.1 

410129 BE244074 Hs.58831 regulator of Fas-Induced apoptosls 4.1 

427521 AW973352 Hs.290585 ESTs 4.1 

3 0 425247 NM.005940H5.155324 matrix metalloproteinase 1 1 (MMP1 1 i stro 4.1 

4128B6 AF041163 Hs.74647 Human T-cell receptor active alpha-chain 4.1 
441153 BE562826 gb:6Q1336534F1 NIH_MGC_44 Homo sapiens c4.1 

444301 AK000136 Hs.10760 asporin (LRR class 1) 4.1 

426711 AA383471 Hs.180669 conserved gane amplified in osteosarcoma 4.1 

35 405850 NA Target Exon 4.1 

44D283 AI732892 Hs.190489 ESTs 4.0 

432441 AW292425 Hs.1 63484 ESTs 4,0 

400264 NA estrogen receptor 1 4.0 

417341 N91453 Hs.102987 ESTs 4.0 

40 429732 U20156 Hs.2488 lymphocyte cytosolic protein 2 (SH2 doma 4,0 

411393 AW797437 Hs.69771 B-factor, properdin 4.0 

425704 U79293 Hs.159264 Human clone 2394B mRNA sequence 4.0 

419594 M013051 Hs.91417 topoisomerase (DNA) II binding protein 4,0 

419092 J05581 Hs.89603 mucin 1, transmembrane 4,0 

45 443147 AI034351 Hs.19030 ESTs 4,0 

40B633 AW963372 Hs.46677 PRO2000 protein 4,0 

433404 T32982 Hs.102720 ESTs 4.0 

421506 BE302796 Hs.105097 thymidine kinase 1, soluble 4.0 

417900 BE250127 Hs.82906 CDC20 (cell division cycle 20, S. cerevi 3.9 

50 414602 AW630088 Hs.76550 Homo sapiens mRNA; cDNA DKFZp564B1264 (f3.9 

413762 AW411479 Hs.848 FK506-b(ndlag protein 4 (59kD) 3.9 

4045B0 NMJ1411 2*:Homo sapiens trichorhinophala 3.9 

452046 AB018345 Hs.27657 KIAA0802 protein 3.9 
4595B7 AA031956 gb:zk15o04.s1 Scares j>regnanLutcrus_NbH 3.9 

5 5 416658 U03272 Hs.79432 fibrillin 2 (congenital contraclural ara 3.9 

426647 AA243464 Hs.294101 pre-B-cell leukemia transcription factor 3.9 

429353 AL1 17406 Hs.200102 ATP-blndlng cassette transporter MRPB 3.9 

41903B AW134924 Hs.190325 ESTs 3.9 

416918 X07871 Hs.89476 CD2 antigen (p50), sheep red blood cell 3.9 

6 0 421977 W941 97 Hs.1 10165 ribosomal protein L26 homolog 3.9 

442567 AI2011B3 Hs.130251 ESTs 3.9 

421168 AF182277 Hs.330780 cytochrome P450, subfamily II B(phenobar 3.9 
431 701 AW935490 Hs. 14653 Human chromosome 5q 13. 1 done 5G8 mRNA3.9 

418526 BE019020 Hs.85B38 solute carrier family 16 (monocarboxylic 3.9 

65 414998 NMJ02543HS.77729 oxidised low density lipoprotein (lectin 3.9 

422790 AA809875 Hs.25933 ESTs 3.9 

419741 NM_007019Hs.93OO2 ubiquitin carrier protein E2-C 3.9 
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430017 AA263172 Hs.35 protein tyrosine phosphatase, non-recept 3.9 

458814 AI49B957 Hs.170861 ESTs, Weakly similar to2195_HUMAN ZINC 3,8 

428514 AW236361 Hs.193139 ESTs 3.8 

434521 NM_002267Hs,3886 karyopherin alpha 3 {importln alpha 4) 3.8 

5 409425 U40462 Hs,54452 zinc finger protein, subfamily 1A, 1 (Ik 3.8 

439560 BE565647 Hs.74899 hypothetical protein FLJ12820 3.8 

424028 AF055084 Hs.153692 Homo sapiens cDNA FU14354 fis, clone Y7 3.8 

400021 AFFX control - HUMISGF3A/M97935_MA 3.8 

453403 BE466639 Hs.61779 Homo sapiens cDNA FLJ13591 fis, clone PL 3.8 

10 445941 AI267371 Hs.172636 ESTs 3.8 

434378 AA631739 Hs.335440 EST 3.8 

429220 AW207206 Hs.136319 ESTs 3.8 

439176 AI446444 Hs.190394 ESTs, Weakly similar to B28096 line-1 pr 3.8 

401045 C110Q1883":gi|6753276|ref|NP_033938.1|c 3.8 

15 430178 AW44961 2 Hs.1 52475 ESTs 3.8 

423397 NfvL001638Hs.1652 chemoxine (C-C motif) receptor 7 3.8 

447630 AS660149 Hs.44S65 lymphoid anhancer-biriding factor 1 3.B 

436391 AJ227892 Hs.146274 ESTs 3.8 

413011 AW068115 Hs.B21 blglycan 3.3 

20 422121 AI767949 Hs.179S33 ESTs 3,8 

452268 NM_003512Hs.28777 H2A histone family, member L 3.8 

427811 M31057 Hs,1 80884 carboxypeptidase B1 (tissue) 3,8 

415579 AA165232 Ms.222069 ESTs 3.8 

437330 AL353944 Hs.50115 Homo sapiens mRNA;cDNADKFZp761J11 12 (f 3.8 

25 427122 AW05773S Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2, 3.7 

400286 NA C16000922:glj7499103|pir||T20903 hypothe 3.7 

420281 AI623693 Hs.191533 ESTs 3.7 

419926 AW900992 Hs.93796 DKFZP586D2223 protein 3.7 

417541 AI992191 Hs. 180040 hypotheOcal protein FLJ22439 3.7 

30 426172 AA371307 Hs.125056 ESTs 3.7 

42S636 AI916662 Hs.2 1 1577 kinsctfn 1 (kinesin receptor) 3,7 

457001 J03258 Hs.2062 vitamin D (1,25- dihydroxyvitamin D3) re 3.7 

424109 AW406878 gb:UI-HF-BL0-adg-g-Q6-0-U1.r1 NIH_MGC_373.7 

417022 NM_014737Hs,80905 Ras association (RaiGDS/AF-6) domain fam 3.7 

3 5 436222 A1208737 Hs.122810 Homosapiensct)NAFLJ11489fis,cloneHE 3.7 

430448 AI633553 Hs.13303 Homo sapiens cDNA: FLJ21784 fis, clone H 3.7 

432729 AK000292 Hs.278732 hypothetical protein FLJ20285 3.7 

413916 N49813 Hs.75615 apolipopratain C-ll 3.7 

421662 NMJ)14141Hs.106552 cell recognition molecule Caspr2 3.7 

40 441633 AW958544 Hs.1 12242 normal mucosa of esophagus specific 1 3.7 

408761 AA057264 Hs.238936 ESTs, Weakly similar to (defline not ava 3.7 

406153 Target Exon 3.7 

445563 AW873S06 Hs. 149006 ESTs 3.7 

453464 AIB84911 Hs.32989 receptor (calcitonin) activity modifying 3.7 

45 448918 AB011152 Hs.22572 KIAA05BO protein 3.7 

413936 AF1 13676 Hs.297681 serine (or cysteine) proteinase inhiblto 3.5 

448069 U75248 Hs.Z0191 seven In absentia (Drosophila) homolcg 2 3.5 

453313 BE005771 Hs.153746 hypothetical protein FLJ22480 3.6 

425234 AW152225 Hs.165909 ESTs, Weakly similar to I38Q22 hypotheti 3.S 

5 0 419941 X98654 Hs.93837 phosphatidylinositol transfer protein, m 3.6 

402397 AF188625 Hs.189507 phospholipase A2, group IID 3.6 

430378 Z29572 Hs.2556 tumor necrosis factor receptor superfami 3.S 

448106 AI800470 Hs.171941 ESTs 3.6 

426431 NM_000458Hs. 169353 transcription fac!or2, hepatic; LF-B3; 3.8 

55 431843 AA516420 Hs.183526 ESTs, Weakly similar to I38022 hypotheti 3.6 

426878 BE069341 gb:QV3-BTO381-270100-O73-cO8BT0381 Homo3.6 

434061 AW024973 Hs.283675 NPD009 protein 3.6 

445292 AV653264 Hs.13982 Homo sapiens cDNA FLJ14666 fis, clone NT 3.6 

452101 T60298 Hs.10844 Homo sapiens cDNA FLJ14476 fis, clone MA 3.6 

60 427581 NM_014786Hs.179703 KIAAQ129 gene product 3.6 

409047 AW961434 Hs.31539 ESTs . 3.6 

416820 NM_000402Hs.a02a6 glucose-6-phosphaie dehydrogenase 3.6 

410386 W26187 Hs.3327 Homo sapiens cDMA: FLJ22219 fis, clone H 3.6 

440518 S42303 Ha.161 cadherin 2, type 1,N-cadherin {neuronal 3.6 

65 434360 AW015415 Hs.127780 ESTs 3.6 

428970 BE276891 Hs.194691 retinoic add induced 3 3.6 

415079 R43179 Hs.22895 hypothetical protein FLJ23548 3.6 

328 



WO 02/059377 



PCT/US02/02242 



409619 AK001015 Ks.552Z0 BCL2-associated athanogene 2 3.6 

430044 AA464510 Hs.152812 ESTs 3.6 

430829 AW451999 Hs.194024 ESTs 3.6 

434224 AA360731 Hs.84 interleukin 2 rector, gamma (severe co 3.6 

5 439247 AF088D20 Hs.46767 EST 3.6 

431542 H63010 Hs.5740 ESTs 3.5 

430713 AA351647 Hs.2642 eukaryotlc translation elongation factor 3.5 

434988 AM18055 Hs.161160 ESTs 3.5 

437748 AF234882 Hs.5814 suppression of tumorigenicity 7 3.5 

10 418322 AA2E416S Hs.84113 cydln-dependent kinase inhibitor 3 (CDK 3.5 

439569 AW602166 Hs.222399 CEGP1 protein 3.5 

459583 AI907673 gb:IL-BT152-OB0399-004BT152Homosapien3.5 

40321 2 NM 01 9595;Homo sapiens interseclin 2 (IT 3.5 

409099 AK000725 Hs.50579 hypothetical protein FLJ20718 3.5 

15 ' 453968 AA847843 Hs.62711 Homo sapiens, clone IMAGE:3351 285, mRNA3.5 

43633B W92147 Hs.1 18394 ESTs . 3.5 

422890 Z43784 Hs.75893 ankyrin3,nodeofRanvler(ankyrinG) 3.5 

442295 AI82724B Hs.224398 Homo sapiens cDNA FLJ1 1469 fis, clone HE 3.5 

417975 AA641636 Hs.30085 hypothetical protein FLJ23186 3.5 

20 433730 AK002135 Hs.3542 hypothetical protein FU1 1273 ■ 3.5 

439926 AW014875 Hs.137Q07 ESTs 3.5 

445873 AA250970 Hs.251946 poly(A)-blnding protein, cytoplasmic 1-1 3.4 

410153 BE311926 Hs.15830 hypothetical protein FLJ 12691 3.4 

422128 AW881145 gb:QVO-OTOQ33-010400-182-a07 OT0033Homo 3.4 

25 414921 BE390551 Hs.77628 steroidogenic acute regulatory protein r 3.4 

441134 W29092 Hs.7678 cellular retlnoic acid-binding protein 1 3.4 

444564 AI167877 Hs.14371B ESTs 3.4 

402470 NA Target Exon 3.4 

418120 AA213437 Hs.192249 ESTs 3.4 

30 422414 AW675237 Hs.13701 ESTs 3.4 

433345 AI681545 Hs.152982 hypothetical protein FLJ13117 3.4 

409213 U61412 Hs.51133 PTK6 protein tyrosine kinase 6 3.4 

422611 AA158177 Hs.116722 fucosyltransferase8(alpha(l,6)fucosy 3.4 

423554 M90516 Hs.1674 glutamine-fmcrtose-5-phosphatetransamin 3.4 

3 5 442619 AA447492 Hs.20183 ESTs, Weakly similar to AF164793 1 prate 3.4 

402359 MA C19001991*;g(i12655111|gb|AAK00751.1|AF23.4 

439393 AA284287 Hs.221504 ESTs 3.4 

415208 F01020 Hs.172004 tiiin 3.4 

452853 AA812633 Hs.10845 ESTs 3.4 

40 429345 R11141 Hs.199595 hypothetical protein 3.4 

449027 AJ271216 Hs.22880 dipeptidylpeptidase III 3.4 

412115 AK001763 Hs.73239 hypothetical protein FLJ1O901 3.4 

432180 Y16416 Hs.272822 RuvB (E coli homo!og)-like 1 3.4 

428977 AK001404 Hs.194698 cyclin B2 3.4 

45 431611 U58766 Hs.264428 tissue specific transplantation antigen 3.4 

418286 AA622528 Hs.319825 Homo sapiens, done IMAGE361 6574, mRNA,3.4 

435695 AF037335 Hs.5338 carbonic anhydrase Xfl (tumor antigen H 3.4 

443378 AW392550 Hs.9280 proteasome (prosome, macropain) subunit, 3.4 

426450 NMJ14791Hs.184339 KIAA0175 gene product 3.3 

50 449571 AW016B12 Hs.200266 ESTs 3.3 

412777 A1335773 Hs.270123 ESTs 3.3 

420542 MM_000505Hs.1321 coagulation factor XII (Hageman factor) 3.3 

412754 AW1 60375 Hs.74585 amyloid beta (A4) precursor-like protein 3.3 

418327 U70370 Hs.84136 paired-like homeodomaln transcription fa 3.3 

5 5 449065 AI627393 Hs.258998 ESTs, Weakly similar to high mobility gr 3.3 

425999 AW513051 Hs,332981 ESTs, Weakly similar to I38022 hypotheti 3.3 

430280 AA361256 Hs.237868 Interleukin 7 receptor 3.3 

407777 AA161071 Hs.71465 squafene epoxldase 3.3 

426516 BE282660 Hs.170197 glutamic-oxaloacetic transaminase 2, mit 3.3 

60 414361 AI086138 Hs.204044 ESTs 3.3 

427080 AW06B287 Hs. 173468 ras-related C3 botullnum toxin substrate 3,3 

426429 X73114 Hs.169849 myosin-binding protein C, slow-type 3.3 

446163 AA0268B0 Hs.25252 prolactin receptor 3.3 

428566 U41 763 Hs.184916 clathrin, heavy polypeptlde-like 1 3.3 

65 418641 BE24313S Hs.86947 adislntegrinandmetalloprotelriasedomEi 3.3 

436293 AI601188 Hs.120910 ESTs 3.3 

411257 AA628967 Hs.115274 ESTs, Highly similar to IHHJ4UMAN INDIAN 3.3 
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430253 AK001514 Hs.236844 hypothetical protein FLJ10652 3.3 

430066 AI929659 Hs,237825 signal recognition particle 72kD 3.3 

436469 AK001455 Hs.5198 Down syndrome critical region gene 2 3.3 

437786 BE142681 Hs. 155573 polymerase (DNA directed), eta 3.3 

5 444079 H0904B Hs.23605 ESTs 3.3 

457183 H918B2 Hs.1 18569 Dvl-binding protein IDAX (Inhibition of 3.3 

431215 AA496078 Hs.121554 HumanDNAsequenrefromcloneRP11-218C13.3 

424563 AA446932 Hs.151428 ret finger protein 2 3.3 

450828 AW270C55 Hs.193804 ESTs 3.3 



10 408652 R43409 Hs.6829 Homo sapiens mRUA for KIM1644 protein, 3.3 
445142 AW978484 Hs.93842 Homo sapiens cDNA: RJ22554 fis, clone H 3.3 
426761 AI0157G9 Hs.172089 Homo sapiens mRNA; cDNA DKFZp586!2022 (f3.3 



439237 AW4D8158 Hs.318893 ESTs, Weakly similar to A47582 B-cell gr 3.3 

422616 BE300330 Hs.118725 selenophosphate synthetase 2 3.3 

15 443247 BE614387 Hs.333893 c-Myc target JP01 3.3 

406663 U24663 Hs.302063 Immunoglobulin heavy constant mi) 3.3 

434137 AA907734 Hs.124895 ESTs 3.3 

408877 AA479033 Hs.130315 ESTs, Weakly simitar to A475B2 B-cellgr 3.3 

439101 C01765 Hs.38750 hypothetical protein FLJ1 1526 3.3 

20 - 408221 AA912183 Hs.47447 ESTs 3.3 

447519 U4S25B Hs.339665 ESTs 3.3 

404755 NA Target Exon 3.3 

451871 AIB21005 Hs.118599 ESTs 3.2 

420319 AW4062B9 Hs.96583 hypothetical proteJn 3.2 



25 430580 M806105 Hs.300697 immunoglobulin heavy constant gamma 3 (G 3.2 
400202 NA NM_002795':Homo sapiens proteasome (pros3.2 



400222 NA NMJ)02082*:Homo sapiens G protein-couple 3.2 

425983 BE045897 Hs.274454 ESTs, Weakly similar to I38022 hypothec' 3,2 

458098 BE550224 Hs.74170 metallothioneln 1E (functional) 3.2 

30 430589 AJ002744 Hs.246315 UDP-N-acetyl-3lpha-D-galactosamine:polyp 3.2 

431563 AS027643 Hs.120912 ESTs 3.2 

442353 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 3.2 

422309 U79745 Hs.114924 solute carrier family 16 (monocarboxyllc 3.2 

4197D3 AI793257 Hs.128151 ESTs 3.2 

3 5 420380 AA640891 Hs.102406 ESTs 3.2 

410853 H04588 Hs.30469 ESTs 3.2 

454417 A1244459 Hs. 110823 trinucleotide repeat containing 9 3.2 

432745 AI821928 gb:nt78f05.x5 NCI_CGAP_Pr3 Homo sapiens 3.2 

422032 AA476966 Hs. 110857 polymerase (RNA) 111 (DNA directed) poly 3.2 

40 415339 NM_015156Hs.78398 KIAA0071 protein 3.2 

426384 AI472078 Hs.303662 ESTs 3,2 

448030 N3Q714 Hs.325960 membrane-spanning 4-domalns, subfamily A 3.2 

418739 AA310964 Hs.88012 SHP2 interacting transmembrane adaptor 3.2 

442053 R35343 Hs.24968 Human DNA sequence from clone RP1-233G163.2 

45 434747 AA837085 Hs.220585 ESTs 3.2 

427297 AW292593 Hs.334907 Homo sapiens, clone MGC:17333, mRNA, com3.2 

412228 AW5037B5 Hs.73792 complement component (3d/Epsteln Barrvl 3.2 

452304 AA025386 Hs.61311 ESTs, Weakly similar to S1059O cysteine 3.2 

453953 AW40B337 Hs.36972 CD7 antigen {p41) 3.2 

5 0 407758 D50915 Hs.3B365 KSAA0125 gene product 3.2 

451149 AL047586 Hs.10283 RNA binding motif protein 8B 3.2 

430015 AW768399 Hs. 112157 ESTs 3-2 

433313 W20128 Hs.298039 ESTs 3.2 

418334 AA319233 Hs.5521 ESTs 3.2 

5 5 450223 AA418204 Hs.241493 natural killer-tumor recognition sequenc 3.2 

454365 AW96672B Hs.54642 methionine adenosyltransferase II, beta 3.2 

451128 AL1 18568 ' gb:DKFZp761t0310_r1 761 (synonym: hamy2)3.2 

417793 AW405434 Hs.82575 small nuclear riconucleoprotein polypept 3.2 

428027 U22029 Hs.334345 cytochrome P450, subfamily HA (phanobar 3.2 

60 441197 BE244638 Hs.166 sterol regulatory element binding transc 3.2 

424634 NMJ03613Hs.1514Q7 cartilage Intermediate layer protein, mi 3.2 

4199B6 AI345455 Hs.78915 GA-binding protein transcription factor, 3.2 

416714 AF283770 Hs.79630 CD79A antigen (immunoglobulin-assoclated 3,2 

4494S5 NMJQ438QHS.23598 CREB binding protein (Rubinstein-Taybi s 3.2 

65 422166 W72424 Hs.112405 S100 calcium-binding protein A9 (calgran 3.2 

409079 W87707 Hs.82065 interleukln 6 signal transducer (gp130, 3.2 

423551 AA327598 Hs.233785 ESTs 3.2 



330 



WO 02/059377 



PCT/US02/02242 



453553 M036B49 Hs.61829 Homo sapiens cDNA FLJ1 2763 fis, clone NT 3.2 

442580 AI733682 Hs.130239 ESTs 3.2 

458079 AI796870 Hs.54277 DNA segment on chromosome X (unique) 9923.2 

425700 AF07G292 Hs.159251 forkhead box H1 3.2 

5 417124 BE122762 Hs,25338 ESTs 3.2 

407104 S57296 Hs.323910 v-erb-b2 avian erythroblastic leukemia v 3.2 

442215 AI703172 Hs,129O05 ESTs, Weakly similar to 2109260A B cell 3.1 

43Q271 T0S199 Hs.237506 DnaJ (Hsp4Q) homolog, subfamily B, membe 3.1 

425317 AW20511B Hs.210546 interieukin 21 receptor 3.1 

10 426095 AI278023 Hs.89986 ESTs 3,1 

442313 BE388898 Hs.8215 hypothetical protein FLJ1 1307 3.1 

424709 AL137589 Hs.1 52149 hypothetical protein DKFZp434K0410 3.1 

429671 BE379335 Hs.211594 proteasome (prosome, macropain) 26S subu 3.1 

432715 AA247152 Hs.2004B3 ESTs, Weakly similar to KIAA1074 protein 3.1 

15 431574 AW572659 Hs.261373 hypothetical protein dJ434014.3 3.1 

436876 A1124756 Hs.5337 isocibte dehydrogenase 2 (NADP), mitoc 3.1 

405017 NA Target Exon 3.1 

433605 AA706910 Hs,112742 ESTs 3.1 

437352 AL353957 Hs.284181 hypothetical protein DKFZp434P0531 3.1 

20 430105 X7D297 Hs.2540 cholinergic receptor, nicotinic, alpha p 3.1 

422083 NM_001141Hs.111256 arachidonate 15-lipDxygenase, second typ 3.1 

413507 BE145360 Hs.190064 ESTs, Weakly similar to I38022 hypotheti 3.1 

415989 AI267700 Hs,317584 ESTs 3.1 

422907 AI879263 Hs.6986 Human glucose transporter pseudogene 3.1 

25 425548 AA890023 Hs.1906 prolactin receptor 3,1 

422599 BE367202 Hs.118636 non-metas(a(ic cells 1, protein (NM23A) 3.1 

439963 AW247529 Hs.6793 platelet-activatlng factor acetylhydrola 3.1 

453883 AI638516 Hs.22630 cofactor required for Sp1 transcriptiona 3.1 

458021 AI885190 Hs.156089 ESTs, Weakly similar to repressor protBi 3.1 

30 418478 U38945 Hs.1174 cyclin-dependent kinase inhibitor 2A (me 3.1 

400814 NA Target Exon 3.1 

402327 Target Exon 3.1 

416935 M190712 gb:zp87f09.r1 Stratagene HeLa cell s3 93 3.1 

43983B AL355722 Hs.106875 Homo sapiens EST from clone 35214, full 3.1 

3 5 437036 AI571514 Hs.133022 ESTs 3.1 

449523 NM_000579Hs.54443 chBmokine (C-C motif) receptor 5 3.1 

406642 AJ245210 gb.'Homo sapiens mRNA for immunoglobulin 3.1 

406624 AF052762 gb:Homo sapiens clone csnegB-1 immunoglo 3.1 

421924 BE514514 Hs.1096D6 coronln, actin-binding protein, 1A 3.1 

4 0 414523 AU076633 Hs,76353 serine (or cysteine) proteinase inhlblto 3.1 

416379 N3B857 Hs.203933 ESTs 3.1 

422823 D89974 Hs.121102 vanln2 3.1 

433904 AI399956 Hs.208956 ESTs 3.1 

421904 BE1 43533 Hs,1 09309 hypothetical protein FLJ20035 3.1 

45 428834 AW899713 Hs.339315 ESTs 3.1 

436043 AW9S383B Hs.168B30 Homo sapiens cDMA FLJ12136 fis, clone MA 3.1 

452823 AB012124 Hs.30696 transcription factor-like 5 (basic helix 3.1 

405381 NA Target Exon 3.1 

428746 AW503820 Hs.1 92861 Spl-B transcription factor (SpM/PU.1 r 3.1 

50 435147 AL133731 Hs.4774 HomosaplensmRMA;cDNADKFZp761C1712(f3.1 

425782 U66468 Hs.159525 cell growth regulatory with EF-hand doma 3.1 

42330S W88562 Hs.10819a ESTs 3.1 

419123 AA234276 Hs.88253 ESTs 3.1 

4385B1 AW977756 Hs.292133 ESTs, Moderately similar to I78885 serin 3.1 

5 5 417105 X60992 Hs.81226 CD6 antigen 3.0 

428361 NM_Q15905Hs.183858 transcriptional intermsdiary factor 1 3.0 

417880 BE241595 Hs.82848 selectin L (lymphocyte adhesion molecule 3.0 

402606 NM_024626:Homo sapiens hypothetical prot 3.0 

401451 NM_004496':Homo sapiens hepatocyte nucle 3.0 

60 421878 AA299652 Hs.111496 Homo sapiens cDN'A FU1 1643 fis, clone HE 3.0 

409518 BE3B4836 Hs.3454 KIAA1821 protein 3.0 

416933 BE561B50 Hs.SO506 small nuclear ribonucleoprotein polypept 3.0 

414324 Y14768 Hs,890 lymphotoxin beta (TNF superfamily, membe 3.0 

425081 X74794 Hs.154443 minichromosame maintenance deficient (S. 3.0 

65 401519 C15000476M1^:2 ! ('ei|'P_012163.1| 3.0 

411704 AI499220 Hs.71573 hypothetical protein FLJ10D74 3.0 

42B819 AL135623 Hs.193914 KIAA0575 gene product 3.0 
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128423 AU076517 Hs.184276 solute carrier family 9 (sodium/hydrogen 3.0 

413835 AI272727 Hs,249163 fatty add hydroxylase 3.0 

412600 L26824 Hs.74101 spteen tyrosine kinase 3.0 

410491 AA465131 Hs.64001 Homo sapiens dene 25218 mRNA sequence 3.0 

43365B L0367B Hs.156110 Immunoglobulin kappa constant 3.0 

427666 AI791495 Hs.180142 calmodulln-like skin protein 3.0 
452514 AI904898 gh:RC-BT068-1 3D399-0B5 BT068 Homo sapien3.0 

429500 X78565 Hs.289114 hexabrachion {tenascin C, cytotacfirr) 3.0 

432485 N9086Q Hs.276770 CDW52 antigen {CAMPATH-1 antigen) 3.0 

437400 AB011542 Hs.5599 EGF-llke-domain, muiiiple 5 3,0 

452234 AW084176 Hs.223296 ESTs, Weakly similar to I38022 hypotheti 3.0 



413269 BE167525 gb:CM4-HT0509-08030D-107-g07 HT0509 Homo 3.0 

453216 AL137566 Hs.32405 Homo sapiens mRNA; cDNA DKFZp5B6G0321 (f 3.0 
400929 ENSP0G000252232*:Sterol regulatory eleme 3.0 



445145 AS961702 Hs.147434 ESTs 3.0 

432615 AA557191 Hs.55029 ESTs, Weakly similar to I54374 gene NF2 3.0 

423279 AW959861 Hs.290943 ESTs 3.0 

429392 AL109712 Hs,296506 Homo sapiens mRNA full length Insert cDN 3.0 

408548 AA055449 Hs.63187 ESTs, Weakly similar to ALUC_HUMAN III! 3.0 

451346 NfvL006338Hs,26312 glioma amplified on chromosome 1 protein 3.0 

413109 AW389845 Hs.110855 ESTs 3.0 

401714 NA ENSPC0000241802':CDNAFLJ11007FIS,CLON 3.0 

421462 AF016495 Hs.104624 aquaporin9 3.0 

421750 AK000768 Hs. 107872 hypothetical protein FLJ20761 3.0 

453293 AA382267 Hs.10653 ESTs 3.0 

457085 AA412446 Hs.98138 ESTs 3.0 

438930 AW843633 Hs.306163 hypothetical protein AL110115 3.0 
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TABLE 22A 

Table 22A shows the accession numbers for those pkeys lacking unigeneED's for Table 22. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs, These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probesei identifier number 

CAT number; Gene duster number 
Accession: Genbank accession numbers 

15 

Pkey CAT number Accessions 

407980 103087 J AA046309 AI263500 M046397 

410785 1221055 1 AWB03341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 

20 411743 1256098J AW862214 AW859811 AW862215 

412136 1279172J AW895387 AW895547 AW895564 AW895323AW895405 AW895539 AW895538 

413269 1356951 1 BE1 67526 BE167651 BEQ76401 R24654 

416935 163179J AA190712 AA190665 AA252564 

422128 211994 1 AW881145M490718 M85637 AA304575T06067M331991 

25 423945 233566_1 AA410943AW948953 M334202AA332B82 

4241 OS 235506J AW40687S AW966560 AW9661 51 AW966496 AA3361 74 M335376 AA335537 

424128 236728_1 AW966163 AA335983AA336011 AA335668 M335973 

425331 250199_1 AW96212B AA355353 M427363 

426878 273265 1 BED59341 AW748403 AL044891 AI908240 AA393080 

3 0 432745 353673J AI82192S AA658826 AA564492 AA635129 1S1 

441153 51084_2 BE562826 BE375727 

448212 755Q99J AI475B5S AW969013 

451128 859865 J AL11B66B D7B823AI762176 

452514 920172J AI904898 Ai904849 AI904899 

3 5 456207 165078,-1 AA193450 
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TABLE 22B 



Table 22B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 22. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref; Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I, et al," refers to the publication 

entitled The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489495. 
Stand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


reposition 


400814 


8569925 


. Minus 


72840-72924,74761-74849 


400929 


7651921 


Minus 


122033-122241,123483-124028 


401045 


8117619 


Plus 


90044-901 84.91 1 1 1-91345 


401451 


6634068 


Minus 


119926-121272 


401519 


6649315 


Plus 


157315-157S50 


401645 


7657839 


Minus 


34986-35133 


401714 


6715702 


Plus 


96484-96681 


401866 


8018106 


Plus 


73126-73623 


4D2327 


7656695 


Minus 


1 08675-108770,1 09801 -1 09910 


402359 


9211204 


Minus 


4040341961 


402408 


9796239 


Minus 


110326-110491 


402470 


9797107 


Plus 


195129-195776 


402542 


9801558 


Minus 


67076-67594 • 


402578 


9884828 


Pfus 


66350-66496 


402606 


9909429 


Minus 


81747-82094 


403011 


6693597 


Minus 


3466-3623 


403212 


7630897 


Minus 


156037-156210 


403329 


8516120 


Plus 


96450-96598 


4033^6 


8763692 


Minus 


4932349652 


403485 


9966528 


Plus 


2888-3001,3193-3532,36554117 


404347 


9838195 


Plus 


74493-74829 


404580 


6539738 


Minus 


240588-241589 


404755 


7706327 


Minus 


53729-53846 


405017 


6532084 


Plus 


35551-35690 


405348 


2914717 


Minus 


4331043462 


405381 


6006920 


Minus 


7636-8054 


405801 


2924321 


Plus 


63469-63694 


405850 


6164995 


Plus 


13871-14110 


406153 


9929734 


Minus 


12902-13069 


406346 


9255985 


Minus 


71754-71944 
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TABLE 23: 320 GENES DOWN-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

5 Table 23 shows 320 genes down-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 22, except that the numerator was set to the median value 
for 12 non-malignant breast specimens, the denominator was set to the median value amongst 
the 73 breast cancers, the 90 th percentile value amongst the 12 non-malignant breast 
specimens was greater than or equal 80 units, and the ratio was greater than or equal to 4.0 
1 0 (i. e. 4-fold down-regulated in tumor vs. normal breast) . 



Pkey: Unique Eds probeset Identifier number 

ExAccn: . Exemplar Accession number, Genbank accession number 

UnigeneSD: Unigene number 

Unigerte Title: Unigene gene title 

R1 : Ratio of SO" 1 percentile normal body tissue to 75" percentile tumor 



Pkey ExAccn 


UnigenelD 


UnigeneTJtle 


Ratio 


428722 U76456 


Hs.1 90787 


tissue Inhibitor of meialloproteircase 4 


2? 4 


428848 NMJ100230 


Hs.1 94236 




17,4 


445263 H57646 


HsA25W 


KIAA1560 protein ^ ^ 


15.4 


418935 T284S9 


Hs.89485 


carbonic anhydrase IV 


15,0 


407228 M25079 


Hs.155376 


hemoglobin, beta 


14.6 


417511 AL049176 


Hs.82223 


chordln-like 


14.6 


439285 AL1 33916 


Hs.172572 


hypothetical protein FLJ20093 


14.3 


412442 AI983730 


Hs.26530 


serum deprivation response (phosphatidyl 


13,6 


410544 A1446543 


Hs.95511 


ESTs 


12,6 


412047 AA934589 


Hs.49696 


ESTs 


12.2 


422667 H25642 


Hs.133471 


ESTs 


12.0 


406864 L34041 


Hs.9739 


giycerol-3-phcsphate dehydrogenase 1 (so 


12.0 


423201 NM_000163 


Hs.125180 


growth hormone receptor 


11.7 


422163 AF027208 


Bs.1 12360 


prominin (mouse)-like 1 


10.6 


428769 AW207175 


Hs.106771 


ESTs 


10.6 


407049 X72632 




NM_021724*:Homo sapiens nuclear receptor 


10.1 


412295 AW088826 


Hs.1 17176 


poly(A)-blnding protein, nuclear 1 


9.B 


425126 N32759 


Hs.172944 


chorionic gonadotropin, beta polypeptide 


9.8 


406791 AI220S84 


Hs.272572 


hemoglobin, alpha 2 


9.5 


447471 AF039843 


Hs.1 8676 


sprouty (Drosophila) homolog 2 


9.5 


451533 NNL004657 


Hs.26530 


serum deprivation response (phosphatidyl 


9.4 


419407 AW410377 


Hs.41502 


hypothetical protein FLJ21276 


9.0 


411939 AI365585 


Hs.1 46246 


ESTs 


9.0 


410532 T53088 


Hs.155376 


hemoglobin, beta 


8.9 


425707 AF1 15402 


Hs.11713 


E74-like factor 5 (ets domain transcript 


8.8 


416585 X54162 


Hs,7936S 


leiomodin 1 (smooth muscle) 


8.7 


443060 D78874 


Hs.6944 


procollagen C-endopeptidase enhancer 2 


8.6 


435265 AA779958 


Hs,185932 


ESTs 


6.5 


422511 AU076442 


Hs.117938 


collagen, type XVII, alpha 1 


8.4 


433138 AB029496 


Hs.59729 


semaphorin sem2 


8.3 


402195 




NM_004437*:Homo sapiens hepatocyte nude 


6.1 


429350 A1754634 


Hs.131987 


ESTs 


8.1 


445107 A1208121 


Hs.147313 


ESTs, Weakly similar to 138022 hypoihefi 


8.1 


406643 N77976 


Hs.272572 


hemoglobin, alpha 2 


8.0 


410199 AW377424 


Hs.205126 


Homo sapiens cONA: FU22667 lis, done H 


8.0 


417225 AA815048 


Hs.24078 


hypothetical protein FLJ12649 


7.8 


437559 AA760849 


Hs.294052 


ESTs 


7.5 


436062 AK000027 


Hs.98633 


ESTs 


7.5 


425078 NIVL002599 


Hs.154437 


phosphodiesterase 2A, cGMP-stlmulated 


7.5 


43D327 AW973636 


Hs.55931 


ESTs 


7.4 
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447577 A1393693 Hs.183297 DKFZP566F2124 protein 7.4 

446039 A1150491 Hs.90756 ESTs 7.2 

422060 R20B93 Hs.325823 ESTs, Moderately similar to AIU5_HUMAN A 7.2 

424455 AA4520Q6 Hs.333199 ESTs 7.1 

5 424343 AW956360 Hs.4748 adenylate cyclase activating polypeptide 7.1 

442792 AI352340 Hs.131194 ESTs 7.0 

406714 A1219304 Hs.283108 hemoglobin, gamma G 6.9 

407571 AI44G183 Hs.9572 ESTs, Highly similar to CYA5_HU MAN ADEN Y 6.3 

429580 AA346839 Hs.209100 DKFZP434C171 protein 6,7 

10 453500 AI478427 Hs.43125 esophageal cancer related gene 4 protein 6.7 

422233 AB002058 Hs.113275 purinergic receptor P2X-»ke 1, orphan r 6.7 

420205 AA256395 Hs.88156 ESTs 6,6 

404363 NA ENSFO00QO241Q75*:TRRAP PROTEIN, 6.6 

447261 NM 006691 Hs.17917 extracellular link domaln-containiiig 1 6.5 

15 417090 AA193282 Hs.85863 ESTs, Weakly similar to B34612 zinc fing 6.5 

410677 NMJM3278 Hs.65424 tetranectin (pSasminc-gen-blnding protein 6.5 

427283 AL1 19796 Hs.174185 ectonuclestidepyrophosphatase/phosphodi 6.5 

4 1501 1 AW963085 gb:EST375158 MAGE resequences, MAGH Homo6.4 

412068 S72043 Hs.73133 metallothlonein 3 (growth inhibitory fac 6.4 

20 416253 BE250659 Hs.15463 Homo sapiens, clone IMAGE:2959994, mRNA 6.4 

435885 AA701483 Hs.36341 ESTs 6.3 

402779 NA Target Exon 6.3 

41B138 AA213626 Hs.136204 EST 6.3 

439335 AA742697 Hs.62492 ESTs, Weakly similar to B39066 prolinc-r 6.3 

25 427019 AA001732 Hs.173233 hypothetical protein FLJ10970 6,2 

411478 BE143068 gb:MR0-HT0158-03020D-O03-b09HT0158Homo6.2 

452654 BE004783 gb:MR2-BN01 14-270400-004-e1 1 BN01 14 Homo6.1 

447359 NMJ12Q93 Hs.18268 adenylate kinase 5 6.1 

414323 NM_014759 Hs.334663 K1AA0273 gene product 6.1 

30 441266 H 15968 Hs.293845 Homo sapiens, clone IMAGE:3502329, mRNA, 6.f 

417011 F08212 Hs.234898 ESTs, Weakly similar to 21 09260A B cell 6.0 

400089 NA Eos Control ■ 6.0 

433614 W07475 Hs.277101 cytochrome c oxidase subunit IV isoform 5.9 

440439 N92618 Hs.64754 ESTs, Weakly similar to potential CDS [H 5.9 

3 5 454404 BE067414 gb:MR4-BTO355-2001 00-201 -e05BT0355 Homo 5.9 

436704 AA062610 Hs.1 48050 EST 5.9 

406563 NA Target Exon 5.9 

433490 AW451023 Hs.65B48 hypothetical protein DKFZp7610132 5.9 

419313 AA843387 Hs.87279 ESTs 5.9 

40 409196 NMJ01874 Hs.334873 carboxypeptldase M 5.B 

410882 AW809163 gb:MR4-ST011 6-261099-01 2-aO3ST01 16 Homo 5.8 

453469 AB0 14533 Hs.33010 KIAA0533 protein 5.8 

441899 A1372588 Hs.8022 TU3A protein 5.8 

426210 AA372052 Hs.334559 Homo sapiens cDNA FLJ1445B tis, clone HE 5.8 

45 413065 BE063555 gb:CM1-BTD283-081199-D33-d09 BT0283 Homo 5.8 

454192 AW876B13 Hs.3343 phosphoglycerate dehydrogenase 5.7 

425187 AW014486 Hs.225D9 ESTs 5.7 

429757 AW452355 Hs.256037 ESTs 5.7 

420202 AL036557 Hs.95910 putative lymphocyte G0/G1 switch gene 5.7 

50 416284 AI695473 Hs.298006 ESTs 5.7 

428553 AA181641 Hs.184907 G protein-coupled receptor 1 5.6 

404689 NA Target Exon 5.6 

438887 R68857 Hs.265499 ESTs 5.6 

40BOB2 S47G33 Hs.82927 adenosine monophosphate deaminase 2 (iso 5.6 

5 5 449748 H23963 Hs.32043 ESTs 5.6 

431048 R50253 Hs.249129 cell death-inducing DFFA-like effector a 5.5 

452205 C15819 gb:C15819 Clontech human aorta polyA mRN 5.5 

434040 AW444613 Hs.288809 hypothetical protein FLJ20159 5.5 

407744 AB020629 Hs,38095 ATP-binding cassette, sub-family A (ABC1 5.5 

60 450606 AI668605 Hs.60380 ESTs, Moderately similar to ALU6J-IUMAN A 5.5 

414629 AA345824 Hs.76688 carboxyiesterase 1 (monocyte/macrophage 5.5 

4016S5 C110007Q3:gi|10048448[feflNPJJ65258.1|g 5.5 

436107 T99079 Hs.191194 ESTs 5.5 

444432 A116142B Hs.75916 splicing factor 3b, subunit 2, 145kD 5.5 

6 5 434715 BE005346 Hs.116410 ESTs 5.5 

447205 BE617015 Hs,11006 ESTs, Moderately similar to T17372 plasm 5.5 

408122 AI432652 Hs.42824 hypothetical protein FU1071B 5.5 
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40 



454016 AWQ16806 
414913 R25621 
459033 AA017590 
441003 BE 172240 
450637 N49826 
44239B AA994520 
403612 NA 
4D7102 AA007629 
410057 R66634 
428232 BE272452 
432769 AA620814 
431344 R99530 ' 
427032 AF012023 
406305 BE261320 
437411 AW613948 
44280Q AI809481 
402054 NA 
432085 AF212829 
415313 R59638 
459159 AI904646 
427164 AB037721 
441391 BE467930 
458959 AI285901 
402698 NA 
401810 NA 
43B879 AA827674 
414557 AA424074 
427809 M26380 
456063 NMJ06744 
451186 AW023469 
4518B2 AI821324 



455282 BE143867 
426488 X03350 
426156 BE244537 
407891 AA48662D 
408610 AW026692 
445987 D59597 
434B13 AI524307 
437526 AI076012 
454775 BE1 60229 
409451 AF012626 
409653 AW502327 
405062 

446490 AK000706 
417622 AW298163 
421978 AJ243662 
440338 R62431 
415421 R35009 
417574 R00348 
409382 AJ243191 
447998 AI766269 
445613 8E550889 
443074 AW341470 
451324 AI7B3600 
432433 AW014734 
449654 AI989812 
414519 N94587 
457531 AW973716 



427555 AW137094 
433545 MB68510 
420334 AI349351 



Hs.23310B ESTs 5.5 

gb:yh45f06.r1 Scares placenta Nb2HP Homo 5.4 

Hs.1 29907 ESTs 5.4 

Hs.126379 ESTs, Weak]/ similar to I3B022 hypolhet! 5.4 

Hs.1 8602 ESTs 5.4 
gb:ou42g09.s1 Soares_NFL_TJ3BC_S1 Homo s5.4 

Target Exon 5.3 

Hs.9739 glycerol-3-phosphate dehydrogenase 1 (so 5.3 

Hs.268107 multimerin 5.3 

Hs.183109 monoamine oxidose A 5.3 

Hs.144959 ESTs 5.3 

Hs.272572 hemoglobin, alpha 2 5.3 

Hs.173274 integrin cytoplasmic domain-associated p 5.3 

Hs.156196 transcriptional adaptor 3 (ADA3, yeast h 5,3 

Hs.194915 ESTs 5.3 

Hs.131227 ESTs 5,3 

Target Exon 5.3 

Hs.272405 potassium channel, subfamily K, member 9 5.3 

HS.61B1 ESTs 5.2 
gb:QV-8T065-020399-103 BT065 Homo saplen 5,2 

Hs.1 73871 KIAA1300 protein 5,2 

Hs.1 70381 ESTs 5.2 

Hs.1 81 297 ESTs 5.2 

ENSP00000251335*;DJ1003J2.1 (sodium and 5.2 

Target Exon 5,2 

Hs.1 89073 ESTs 5.2 

protein phosphatase 1 , regulatory (inhib 5,2 

lipoprotein lipase 5.1 

Hs.75461 retinol-binding protein 4, interstitial 5.1 

Hs.65256 ESTs, Weakly similar to leucine-rich gli 5.1 

Hs.10D445 ESTs 5,1 

NM_021620:Homo sapiens PR domain contain .5.1 

Hs.2719 HE4; WFDC2; putative ovarian carcinoma m 5.1 

Hs.126820 ESTs 5.1 

Hs.269122 ESTs, Weakly similar to ALU1_HgMAN ALU S 5.1 
gb:MR0-HT01 64-070100-013-1)02 KT0164 HomoS.1 

Hs.4 alcohol dehydrogenase 1B (class I), beta 5.1 

Hs.167382 natriuretic peptide receptor A/guanylats 5.1 

Hs,41135 endomucin-2 5.0 

Hs.224829 ESTs 5.0 

Hs.1 18821 CGl-62 protein 5.0 

Hs.1 62870 ESTs 5.0 
Hs.1213BB ESTs, Weakly similar to MDHCJHUMAN MALAT 5.0 
gb:QV1-HT0413-Q90200-062-a1 2 HT041 3 Homo 5.0 

Hs.54472 fragile X mental retardation 2 5.0 

gb:UI-HF-BROp-aka-a-07-0-Ul.r1 NIH_MGC_5 5.0 

Target Exon 5.0 

Hs.15125 hypothetical protein FLJ20699 5,0 

Hs.82318 WAS protein family, member 3 5.0 

Hs.110196 N1CE-1 protein 5.0 

Hs.12753 ESTs 5.0 

Hs,24903 ESTs 5.0 

gb:ye69e06.r1 Soares fetal liver spleen 5.0 

Hs.56874 heat shock 27kD protein family, member 7 5.0 
Hs.304389 ESTs 
Hs.1 58491 ESTs 
Hs, 144907 ESTs 
Hs.208052 ESTs 
Hs.157969 ESTs 
Hs.199850 ESTs 
Hs.55063 ESTs 
Hs.13913 K1AA1 577 protein 
Hs.192725 ESTs 

Hs.247920 cholinergic receptor, muscarinic 5 
Hs.97990 ESTs 
Hs.112495 ESTs 

Hs.118944 hypothetJcai protein FLJ22477 
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421795 X63094 
427138 N77624 
413072 BEG63965 
443721 AW450451 
408053 AW139474 
427067 M843716 
442969 AI025499 
426220 AI3B3475 
414593 BE386764 
426893 AA398716 
434046 AW292618 
401590 NA 
457971 AW134679 
427722 AK000123 
443793 M045290 
.407737 R49187 
441955 M972327 
441499 AW29B235 
447517 AI382726 
403017 

450580 N40087 
404611 H58589 
414831 M31158 
459290 NM_001545 
444341 AI142027 
408S14 AL13769B 
449638 AW204277 
434418 AF134707 
447360 AI375984 
419583 F00312 
440698 AI34B455 
451199 AI290653 
438338 NMJM4861 
433756 AW015933 
423301 S67580 
417237 H863B5 
439745 AL389981 
424137 AA335769 
449338 H73444 
434744 N94635 
407402 AF035303 
443510 NMJ11219Q 
415754 AA169114 
415986 Z43619 
457416 BE142Q52 
418064 BE3B7287 
437120 AI356125 
453950 AA156998 
401093 

436935 AW206494 
457974 AW842353 
428222 AL133112 
442705 AI264534 
437409 AL359599 
458494 AI3BO906 
410490 H03569 
416069 R37101 
438463 AA307958 
444043 AI499723 
438327 H87407 
457711 AF147401 
400870 

441425 AA933590 
416267 H45384 
407262 M12873 
444567 AV654020 



Hs.246862 
Hs.177927 
Hs.13223B 
Hs.171697 



Hs.1 42903 
Hs.101689 
Hs.182434 

Hs.15248 
Hs.35156 
Hs.77439 
Hs.34853 
Hs.146650 
Hs.46531 
Hs.250723 
Hs.278679 
Hs.167216 

Hs. 147492 
Hs.1 24758 
Hs.6168 
Hs.1 12654 
Hs.1 645 
Hs.61737 
Hs.149219 
Hs.1 6262 
Hs.394 
Hs.283828 



HS.B3384 
Hs.1 57767 
Hs.211588 

Hs.253560 
Hs.321717 
Hs.183085 
Hs.131127 
Hs.283850 
Hs.158436 



;c4,7 
4.7 
4.7 



Rhesus bleed group, D antigen 4.8 

phosphatide acid phosphatase type 2B 4.B 
gb:QV3-BT0296-140200-085-hQ1 BT0296 Homo 4.6 

ESTs 4.8 

ESTs 4.8 

ESTs 4.7 

ESTs 4.7 
ESTs, Weakly similar to T1 3924 sdk prote 
gb:601273249F1 NIH_MGC_20 Homo sapiei 



ESTs 

Target Exon 4.7 

ESTs 4.7 

hypothetical protein FLJ201 16 4.6 

ESTs, Weakly similar to 21 09260A B cell 4.6 

ESTs 4.6 

ESTs 4.6 

ESTs 4.6 

ESTs 4.6 

Tasge! Exon 4.6 

ESTs 4.5 

Homo sapiens cDNA FLJ11027 lis, done PL 4.6 

protein kinase, cAMP-dependent, regulate 4.6 

inhibitor of DNA binding 4, dominant neg 4.6 

ESTs 4.6 
Homo sapiens mRNA; cDNA DKFZp434C1915 (f4.6 

hypothetical protein MGC2747 4.6 

a disintegrin and metalloproteinase doma 4.6 

ESTs 4.6 

gb:HSBB0D101 STRATAGENE Human skeletal m4.6 

■ Homo sapiens cDNA FU11777 fis, clone HE 4.6 

ESTs 4.6 

KIAA0703 gene product 4.6 

Homo sapiens, clone MGC:9764, mRNA, comp 4.5 

cytochrome P450, subfamily IVA, polypept 4.5 

palmitoyl-protein thioesterese 2 4.5 

Homo sapiens mRNA full length insert cDN 4.5 

ESTs 4.5 

adrenomedu'iin 4.5 

Homo sapiens genomic DNA, chromosome 21q 4.5 

gb:Homo sapiens clone 23943 mRNA sequenc 4.5 

formyltetrahydrofolate dehydrogenase 4.5 

hypothetical protein FLJ1 141 3 4.5 

gb:HSC1GE121 normalized Infant brain cDN 4,5 

kringle-containing transmembrane protein 4.5 

S100 calcium-binding protein, beta (neur 4.4 
ESTs, Weakly similar to HXA2JH UMAN HOMEO 4.4 

eukaryotio translation Initiation factor 4.4 

C1200Q586*:gi|6330167|dbj|BM86477.1| {A 4.4 

ESTs . 4.4 

ESTs, Weakly similar io S22765 heterogen 4.4 

Homo sapiens mRNA; cDNA DKFZp434K098 (fr 4.4 

ESTs 4.4 
Homo sapiens mRNA; cDNA DKFZp547C126 (fr 4.4 

ESTs 4.4 

gb:yj42d08.r1 Scares placenta Nb2HP Homo 4 .4 

ESTs 4.4 

ESTs 4.4 

ESTs 4.4 

chorionic gonadotropin, beta polypeptide 4.4 

ESTs 4.3 

C11000905:gl|11692565]gb]AAG39879.1|AF28 4.3 

homeobox protein from AL590526 4.3 

gb;yn99c10.r1 Soares adult brain N2b5HB5 4.3 

gb:Human lg rearranged H-chaln mRNA VDJ4 4.3 
ESTs, Weakly simi* ' 
Target Exon 
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410034 BE087414 gb:MR4-BT0355-200100-2G1-e05BT0355Homo4.3 

456604 AI421645 Hs.139651 caveo&i2 4.3 

448427 BE395260 Hs.309438 EST 4.3 

416931 D45371 Hs.80485 adipose most abundant gene transcript 1 4.3 

5 421296 NMJJ02666 Hs,103253 perilipin 4.3 

400973 NA ENSP00000236657*:Mucln 5B (Fragment). 4.3 

452602 AW366194 Hs.55962 ESTs 4.3 

412330 NM_00510a Hs.788 A kinase (PRKA) anchor protein (gravin) 4.3 

405016 CY0Q01 71 *:gi|9260405]gb|AAF66402. 1 [AF245 4.3 

10 435104 AI475671 Hs.68607 ESTs, Highly similar to F-box protein FB 4.3 

406118 ENSP00000246632:CDNAFLJ20261 fis, done 4.3 

418556 T02850 gb:FB12A9 Fetal brain, Stratagene Homo s 4.3 

429745 AA480818 Hs.221736 ESTs 4.3 

433088 AW451206 Hs. 115899 ESTs 4,3 

15 444445 AA342329 Hs,1 15920 Homo sapiens cDNA: FLJ22816 fis, clone K 4.3 

453880 AI803166 Hs.28462 ESTs, Weakly similar to I3B022 hypottietl 4.3 

447384 AI377221 Hs.40528 ESTs 4.2 

414541 BE293116 Hs.76392 aldehyde dehydrogenase 1 family, member 4.2 

444975 AV652165 Hs.182482 ESTs, Weakly similar to T003S2 hypothec 4.2 

20 403921 NA C500O212*:gi[10047237|dbj|BAB13407.1|{A 4.2 

451477 AI798425 Hs.42710 ESTs 4.2 

4D6344 C5001660:gi|11611537|dbj|BAB18935.1](AB 4.2 

416970 AA191201 Hs.35861 DKFZP586E1621 protein 4.2 

413662 BE155866 Hs.25522 KIM 1808 protein 4.2 

25 458504 AW070634 Hs.144794 ESTs 4.2 

404682 NA C9Q01188*:gi|12738842|refjNP_073725.1| p 4.2 

418089 N69913 Hs.6858 ESTs, Weakly similar to I78885 serinerth 4.2 

403433 NM_00 1 622:Homo sapiens alpha-2-HS-glycop 4.2 

446532 AW975460 Hs.143563 ESTs 4.2 

30 414217 AI309293 Hs.279898 Homo sapiens cDNA: FLJ23165 fis, clone L 4.2 

418425 AI871247 Hs.6262 hypothetical protein MGC8407 4.2 

419589 AW973708 Hs.201925 Homo sapiens cDNA FLJ13446 fis, clone PL 4.2 

457029 M397789 Hs.161803 ESTs 4.2 

447860 AF193807 Hs.131835 Rhesus blood group, B glycoprotein 4.2 

35 44B988 Y09763 Hs.22785 gamma-amincbutyrieacid(GABA)Arecepto 4.2 

440610 AI73309B Hs,130800 ESTs 4.2 

439590 AF086410 gb:Homo sapiens full length insert cDNA 4.2 

427240 AA399975 Hs.274151 ligatin 4.2 

408932 AW594172 Hs.278513 TP53TQ3 protein 4,2 

40 436112 T77545 Hs.187559 ESTs 4.2 

4443B2 AI144152 Hs.58246 ESTs 4.2 

456716 AA31806Q Hs.135121 hypothetical protein FU224 15 4.2 

419846 NM.015977 Hs.285681 Williams-Eeuren syndrome chromosome regi 4.2 

410D36 R57171 Hs.57975 calsequestrin 2 (cardiac muscle) 4.1 

45 40D545 NA Target Exon 4.1 

403051 NA Target Exon 4.1 

420139 NMJ05357 Hs.95351 lipase, hormone-sensitive 4.1 

450244 AA007534 Hs, 125062 ESTs 4.1 

453261 AA034115 Hs.1 18494 ESTs 4.1 

50 440246 W52010 Hs.191379 ESTs 4.1 

414516 AI307802 Hs.135560 ESTs, Weakly similar to T43458 hypothefl 4.1 

438232 AI150595 Hs.122226 ESTs 4.1 

410233 AA082947 gb:zn10g07.s1 Stratagene hNT neuron (937 4.1 

412179 BE270758 Hs.69428 hypothetical protein MGC3020 4.1 

5 5 441871 Ai306150 Hs. 153450 ESTs, Weakly similar to 1909123A Na glue 4.1 

426411 AK000708 Hs,169764 hypothetical protein FU20701 4.1 

453692 AL110416 gb:DKFZp434KD431_r1 434 (synonym: htes3) 4,1 

448640 AW817177 Hs.10255B Horno sapiens, clone MGC:5352, mRMA, comp 4.1 

417481 AA203281 Hs,21798 ESTs 4,1 

60 412912 AW118878 Hs.110835 ESTs 4.1 

454183 AWB07116 gb:MR4-STG062-04010Q-O24-t)12ST0062Homo4.1 

426328 AW631296 gb;hh83c09,y1 NCI_CGAP_GU1 Homo sapiens 4.1 

435942 R06285 Hs.19'1215 ESTs 4.1 

417629 T76945 gb:yc92c07.r1 Soares Infant brain 1 NIB H 4.1 

6 5 403593 NA Target Exon 4.0 

402690 Target Exon 4.0 

418190 R49591 Hs.270425 ESTs 4.0 

339 



WO 02/059377 



PCT/US02/02242 



40B641 AW245207 


Hs.5555 


hypothetical protein MGC5347 




427899 AA829286 


Hs.332053 


serum amyloid A1 


4.0 


445975 AIB1153B 


Hs.145734 


ESTs 


4.0 


433831 BE263273 


Hs.6439 


syrapsin II 


4.0 


455578 BE006350 


Hs.14355 


Homo sapiens oDNA FU 13207 (is, clone NT 


4.0 


401640 NA 




Target Exon 


4.0 


413753 U17760 


Hs.75517 


lamlnln, beta 3 (nicein (125kD), kalinln 


4.0 


445030 AI205925 


Hs.147238 


ESTs, Highly similar to AAC3„HUMAN ALPHA 


4.0 


433873 AW15S913 


Hs.150478 


ESTs, Weakly similar to A Chain A, Crysi 


4.0 


45673B AW248217 


Hs.1619 


echaete-scuie complex (Drosophila) homol 


4.0 


450112 BE047734 


Hs.5473 


ESTs, Moderately similar to ALU5_HUMAN A 


4.0 


44B906 A1589567 


Hs.309719 


ESTs 


4.0 
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TABLE 23A 

Table 23 A shows the accession numbers for those pkeys lacking unigenelD's for Table 23. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey; Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



20 410034 1170594.1 

410233 118656J 

410490 1205347J 

410882 1225686 1 

411478 1247073.1 

25 413065 1347960J 

413072 1348163.1 

414593 1464909J 

414913 1506721,1 

415011 151328J 

3 0 415985 1564410J 

416267 1583547.1 

417574 16B7770.1 

417629 1690392J 

418556 1767866.-1 

3 5 4195B3 1B6198J 

'426328 264901J 

439590 47413J 

442398 541271J 

452205 9041 5_1 

40 452654 925931J 



AW502327 AW502488 AW501B29 AW5D2525 AW502687 

BE067414 BE067958 BHC67419 8E0679B3 AW577127 AW501412 

AA082947AA083036 

H03589AW750687AW750588 

AW8Q91 63 AW809247 AW809177 AW809190 AW809225 

BE143068 AWS49143 AW848705 AW848569 AW848071 AW848475 AW848092 A W64B005 
BE063555 BE151321 BE151319 BE151657 BE151655 BE063556 BE151322 

BE063965 8E0B3968 BE064034 BE064O28 BEQ63874 BE063966 BE063B69 BE064O43 BE064033 BE063884 



454404 1170594.1 

454775 1234106.1 

455282 1273020.1 

459159 919998.1 



R25621 C03959 C04010 
AW9630B5AA159005AW963073 
Z43619 R61274 H12206 R128B3 
H45384H49125 H41699 
R00348 R09593 
T76945R20210R05755 
T02850 

F00312 AA247490 F31427 AA3B3663 F22045 

AW631295AA375484 

AF086410 W94386W74609 

AA994520AW393574 

C15819AA024741AA024742 

BE004783 BE004947 A1911790 

AL110416AW876759 

AW807118 AW807569 AW807415 AW807338 AWB07288 AW807263 AWB073 16 AW 177402 AW807413 AW807068 BE141561 
BE141569AW807401 AW807310BE141555AW807318AW807119AW807299AW807241 AW807225 AW807204 AW807345 
AW807103 BE141615 AW807431 AW807393 AWB07337 AWB07406 AW807259 AWB07375 AW84589D AW8O7220 AW807399 
AW8070B4 AW807376 AW807024 BE141595 AWB07236 AW807027 AW8071 1 2 AW807378 AW807202 BE141 593 AW807216 
AW8071 38 A W807244 AW807221 AW807297 AW807050 AW807248 AW807404 AW807075 AW807237 AWB0721 2 AW807308 
AW8071 10 AW8071 04 BE 14091 2 AW807301 AW807382 AW807294 AW807026 AW807020 AW8071 08 AW807025 AW807433 
AW8071 24 AW807419 AW807031 AW807264 AW807032 AWB07029 AW807052 AW807391 AW807207 AW807215 AW80701 9 
AW807238 AW807201 8E141590 AW807302 AW807323 AW8O73B0 AW807109 BE1415B8 AW845877 AWB07418 AWB07407 
AW807309 BE141614 AW345B61 AW807396 AWB07300 AW807348 AW80731 1 AW807214 AWSQ7132 AW807402 AWSO7350 
AW807028 AW80729B AW807291 AW807305 AW807217 AW807312 AW807261 AW807268 AW807260 AW807266 AW80719B 
AW807346 AW807315 BE140940 AWBD7351 AWB07122 AW807235 AW807076 AWB07314 AW807213 AWB07143 AW807131 
AW800079AW807066AW807077AW845923AW807421 BE141619 BE140943AW807420 AW807381 BE141621 AW845921 
BE141629 BE141625 BE141624 BE141636 BE141630 AWB07405 AW807290 AW807353 
BE067414 BE06795B BE0B7419 BEB679S3 AW577127 AW601412 

BE1B0229 AW819879 AW820179 AW819882 AWB19876 AWB20169 BE153201 AW99373S BE152911 
BE143867 AW935060 AW886684 
AI904646 BE179494 BE179421 
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TABLE 23B 



Table 23B shows the genomic positioning for those pkeys lacking unigene ID'S and 
accession numbers in Table 23. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gi) numbers. "Dunham I. et al." refers to the publication 

entitled The DNA sequence of human chromosome 22." Dunham I, et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposltlon: Indicates nucleotide positions of predicted exons. 



Pfcey 


Ref 


Strand 


NUiosition 


400545 


9800107 


Minus 


124618-124881 


400B7Q 


9838306 


Minus 


34081-35027 


400973 


79S0452 


Minus 


98119-98253 


401093 


8516137 


Minus 


22335-23166 


401590 


9966320 


Minus 


33547-33649 


401665 


7145001 


Plus 


121591-122537 


401810 


7342191 


Plus 


129063-129476 


401840 


7684597 


Plus 


56283-56439 


402054 


8083691 




8288-8806 


402195 
402583 


76B9778 
7684486 


Minus 
Plus 


147901-148884 
94883-95003 


402690 


8348058 


Plus 


13368-13998 


402698 


8570304 


Minus 


108641-108903 


402179 


9588555 


Minus 


38173-39210 


403D17 




Pius 


78630-79367 


403051 


4827080 


Minus 


5269-5411 


4032S3 


7770677 


Plus 


52431-52737 


403433 


9719611 


Minus 


72225-72437 


403593 


6862650 


Minus 


62554-62712,69449-69602 


403612 


8469060 


Minus 


94723-94859 


403921 


7711590 


Minus 


3297-3536 


404358 


7630956 


Minus 


102053-102199 


404632 


9797231 


Minus 


40977-41150 


404689 


7534100 


Plus 


119461-119717 


405016 


6524300 


Plus 


51997-53308 


405062 


7657730 


Plus 


101283-101432 


406118 


9143818 


Plus 


53997-54629 


406344 


9255974 


Plus 


20254-20374,20526-20859,20635-21097 


406563 


7711604 


Plus 


34401-34538 
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TABLE 24: 

Table 24 depicts Seq ID No., UnigenelD, UnigeneTitle, Pkey, Pred.Cell.Loc, and ExAccn 
for all of the sequences in Table 25. The information in Table 24 is linked by Seq ID No. to 
Table 25. 



Pkey: 

ExAccn: 

UnigenelD: 

Unigene Title: 

Pred.Cell.Loc: 

Seq.ID.No.: 



Unique Eos probeset Identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 



Predicted Cellular Localization 

Sequence Identification Number found in Table 25 



Pkey ExAccn UnigenelD UnigeneTitle 



Pred.Cell.Loc. Seq. ID. Mo, 



449746 AIS68594 Hs.1765BB 
407276 AI951118 Hs.326736 
415539 AI733B81 tfs.72472 
400297 AM 27076 Hs.334473 
450375 AA009647 Hs.8850 
102457 NIVl.001394Hs.2359 
429170 NM_001394Hs.2359 
424399 AI905B87 Hs.2533 
422505 AL120862 Hs.124165 
449765 N92293 Hs.20S832 
425692 O9D041 Hs.1 55956 
426215 AW963419 Hs.1 55223 
439B4D AW44921.1 Hs.1D5445 
410102 AW24B508 Hs.278727 
429220 AW207206 Hs.136319 
416276 U41050 Hs.79136 
409079 W87707 Hs.82065 
442818 AK001741 Hs.8739 
442082 R41823 Hs,7413 
444381 SE387335 Hs.283713 
446163 AA026880 Hs.25252 
416636 N32536 Hs.42645 
442117 AW664964 Hs.128899 
433043 W57554 Hs.1 25019 
429353 AL1 17406 Hs.200102 
452190 H26735 Hs.91668 
446733 AABS3360 Hs.Z6040 
452747 BE153B55 Hs,61460 
423242 AL039402 Hs.1257B3 
417433 BE270266 Hs.82128 
432201 AI536613 Hs.298241 
423961 D13666 Hs,13634B 
439569 AW602166 Hs.222399 
114480 BE066778 Hs,151678 
404561 
325372 NA 

1122B7 ABO33064 Hs,334806 
335824 NA 

424735 U31875 Hs.272499 
4002B9 X07820 Hs.2258 
4275B5 D31152 Hs.1 79729 
429925 NK.000786HS.226213 
429441 AJ224172 Hs.204096 
421155 H87879 Hs.102267 
420931 AF044197 Hs.100431 
420813 X51501 Hs.99949 
452744 A1267652 Hs.30504 



ESTs, WeaWy similar to CP4Y_HUMAN CYTOC 
Homo sapiens breast cancer antigen NY-BR 
BMP-R1B 

hypothetical protein DKFZp56401278 

a dislntegrin and metalloprotelnasa doma 

dual specificity phosphatase 4 nuclear 

dual specificity phosphatase 4 nuclear 

aldehyde dehydrogenase 9 family, member cytoplasm 

ESTs 

ESTs, Moderately similar to ALU8_HUMAN A 

N-acetyltransferase 1 (arylamine N-acety 

stanniocalcin 2 

GDNF family receptor alpha 1 

Homo sapiens cDNA FLJ14035 fis, clone HE 

ESTs 

LIV-1 protein, estrogen regulated 
interfeukin 6 signal transducer (gp130, 
hypothetical protein FLJ10879 
ESTs 

f-Z T ', ' feakly n Eai to S640 54 hypoihefJ 
Homo sapiens cDNA FLJ13603 fis, clone PL 
solute carrier family 16 (monocarboxylic 
ESTs 

lymphoid nuclear protein (LAF-4) mRNA 
ATP-bindlng cassette transporter MRP8 
Homo sapiens clone PP1498 unknown mRNA 
ESTs, Weakiy similar to fatty acid omega 
Ig superfamily receptor LNIR 
DEME-6 protein 

5T4 oncofetal trophobiast glycoprotein 
Transmembrane protease, serine 3 
osteoblast specific factor 2 (fasciclin 
CEGP1 protein 

UDP-N-acetyl-atpha-D-galacfosamine:polyp 
NM_01 41 1 2*:Homo sapiens trichorhinophala mitochodrla 
Phase 2 & 3 Exons nuclear 
WAA1238 protein 

ENSPQQ0Q0249072':DJ222E13.1 (N-TERMiNAL 
short-chain atcotiol dehydrogenase family 
matrix metalloproteinase 10 (stromelysln 
collagen, type X, alpha 1 (Schmid metaph 
cytochrome P450, 51 (lanosterol 14-alpha ER 
lipophllln 8 (uteroglobin family member) 
iysyl oxidase extracellular 
small Inducible cytokine B subfamily (Cy 
prolactin-induced protein nuclear 
Homo sapiens mRNA; cDNA DKFZp434E082 {fr 



Seq(D1&2 
Seq ID3&4 
SeqiD5&6 
Seq ID 7 & S 
Seq ID 9 & 10 
Seq ID 11 & 12 
Seq ID 11 & 12 
Seq ID 13 & 14 
Seq ID 15 & 16 
Seq ID 17 & 18 
Seq ID 19 & 20 
Seq ID 21 & 22 
Seq ID 23 & 24 
Seq ID 25 & 26 
Seq £D 27 & 28 
Seq ID 29 & 30 
Seq iD 31 &32 
Seq ID 33 & 34 
Seq ID 35 & 36 
Seq iD 37 & 38 
Seq ID 39 & 40 
Seq ID 41 & 42 
Seq ID 43 & 44 
Seq ID 45 & 46 
Seq ID 47 & 48 
Seq ID 43 & 50 
Seq ID 51 & 52 
Seq ID 53 & 54 
Seq ID 55 & 56 
Seq ID 57 & 58 
Seq ID 59 & 60 
Seq ID 61 & 62 
Seq ID 63 & 64 
Seq ID 65 & 56 
Seq ID 67 8. 68 
Seq ID 69 & 70 
Seq ID 71 & 72 
SeqlD73&74 
Seq ID 75 S 76 
3eclD77&7B 
Seq ID 79&80 
SeqlD81 S.B2 
Seq ID 83&84 
Seq ID 85&B6 
Seq ID 87 & B8 



Seq ID 91&92 
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X7B592 Hs.99915 
NMJQ2497Hs,153704 
NM_00705QHs,225952 
AB012113 Hs.16530 
AJ245671 Hs.12344 
AA321649 Hs.2248 
W67883 Hs.13747B 
Y13153 Hs.107318 
U65011 Hs.30743 
AU077005 Hs.9220B 
X72755 Hs.77367 
W47595 Hs.169300 
L32137 HS.15B4 
AA10267O Hs.70725 
AWOS7903 Hs.82772 
AI249368 Hs.96558 
X63629 HS.2B77 
U764S6 Hs.190787 
AB026436 Hs.177534 
Y15221 Hs.1039B2 
BE3 14524 Hs.78776 
N48373 Hs.10247 
AI955040 Hs.265398 



androgen receptor {dihydrotestosterone r 
NIMA (never in mitosis gene a}-related k 
protein tyrosine phosphatase, receptor f 
small inducible cytokine subfamily A (Cy 
EGF-like-domain, multiple 6 
small inducible cytokine subfamily B (Cy 
patemaliy expressed 10 
kynurenine 3-monooxygenase (kynurenlne 3 
preferentially expressed antigen in mela 
a disiniegrin and melalloprotelnase doma 
monokine induced by gamma Interferon 
transforming growth factor, beta 1 
cartilage oiigomeric matrix protein (pse 
gamma-aminobutyric acid (GABA) A recepto 
collagen, type XI, alpha 1 
ESTs 

cadherin 3, type 1, P-cadherin (placenta 
tissue inhibitor of metalloprotelnase 4 
dual specificity phosphatase 10 
small Inducible cytokine subfamily B (Cy 
putative transmembrane protein 
activated leucocyte cell adhesion molecu 
ESTs, Weakly similar to transformations 



extraocular 



SeqID 93 5.94 
SeqID 95 5 96 



100 



101 & 102 
SeqID 103 & 104 
SeqlD 105 & 106 
SeqiD 107 & 108 
SeqlD 109 & 110 
SeqlD 111 & 112 
SeqlD 113 & 114 
SeqID 115 & 116 
SeqlD 117& 118 
Seq 10 119 & 120 
Seq ID 121 & 122 
SeqiD 123 & 124 
plasma membraneSeq ID 125 & 126 
Seq ID 127 & 128 
Seq ID 129 & 130 
Seq ID 131 & 132 
Seq ID 133 & 134 
SeqID 135 & 136 
Seq ID 137 & 138 



extracellular 
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TABLE 24A 

Table 24A shows the accession numbers for those pkeys lacking unigenelD's for Table 24. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
5 were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs, These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

1 0 Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 

15 Pkey CAT number Accession 

33S824 CH22J197FG_619J1_UNK_E 
325372 c12_hs 
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TABLE 24B 

Table 24B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 24. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Ptey: Unique number conespanding to an Eos probeset 

1 0 Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et aJ." refers to the publication 

entitled "The DMA sequence of human chromosome 22." Dunham I. etal., Mature (1 999) 402:489-495. 
Strand: indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons, 

15 

Pkey Ref Strand NLposition 
404561 9795080 Minus 6903970100 
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Table 25 

The 69 gene sequences identified to bo overexposed in breast cancer may be used to identify coding regions from the public DNA databases (nr and htgs in 
Genbank), The sequences may be used to either identify genes that encode known proteins, or tbey may be used to predict the coding regions from genomic 
DNA using exon prediction algorithms, such as FGENESH (Salamov and Solovytv, 2000, Genome Res. 10:516-522) 

Seq ID NO: 1 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted ORF 

Coding sequence; 1-1518 (underlined sequences correspond to start and stop eadons) 
1 11 21 31 41 51 

ATGG AGCCCT CCTGGCTTCA GG AACTCATG GCTCACCCCT TCTTGCTGCT GATCCTCCTC 60 
TGCATGTCTG TGCTGCTGTT TCAGGTAATC AGGTTOTACC AGAGGAGGAG ATGGATGATC 120 
AGAGCCCTGC ACCTGTTTCC TGCACCCCCT GCCCACTGGT TCTATGGCCA CAAGGAGTTT 1 80 
TACGCAGTAA AGGAGTTTGA GGTGTATCAT AAGCTGATGG AAAAATACCC ATGTGCTGTT 240 
CCCTTGTGGG TTGGACCCTT TACGATGTTC TTCAGTGTCC ATGACCCAGA CTATGCCAAG 300 
ATTCTCCTG A AAAG ACAAGA TCCCAAAAGT OCTGTTAGCC ACA AAATCCT TG AATCCTGG 3 60 
GTTGGTCGAG GACTTGTGAC CGTGGATGQT TCTA AATGGA AAAAGCACCG CCAGATTGTG 420 
AAACCTGGCT TCAACATCAG CATTCTOAAA ATATTCATCA CCATGATGTC TGAGAGTGTT 480 
CGGATGATGC TGAACAAATG GGAGGAACGC ATTGCCCAAA ACTCACGTCT GGAGCTCTTT 540 
C AACATGTCT CCCTGATGAC CCTGGACAGC ATCATG AAGT GTGCCTTCAG CCAGCAGGGC 600 
AGCATCCAGT TGGACAGTAC CCTGGACTCA TACCTGAAAG CAGTGTTCAA CCTTAGCAAA 660 
ATCTCCAACC AGCGCATGAA CAATTTTCTA CATCACAACG ACCTGGTTTT CAAATTCAGC 720 
TCTCAAGGCC AAATCTTTTC TAAATTTAAC CAAGAACTTC ATCAGTTCAC AGAGAAAGTA 780 
ATCCAGGACC GGAAGGAGTC TCTTAAGGAT AAGCTAAAAC AAGATACTAC TCAGAAAAGG 840 
CGCTGGGATT TTCTGGACAT ACTTTTGAGT GCCAAAAGCG AAAACACCAA AGA'ITTCTCT 500 
GA 3GAGAT 1C_AI i 1 ! AfiTGA/ 1 .CG TTt Ti ITTG C ICATCA C Z C/ rCX 
AGTGCTATCT CCTGGATCCT TTACTGCTTG GCAAAGTACC CTGAGCATCA GCAGAGATGC 1020 
CGAGATGAAA TCAGGGAACT CCTAGGGGAT GGGTCTTCTA TTACCTGGGA ACACCTGAGC 1080 
CAGATGCCTT ACACCACGAT GTGCATCAAG GAATGCCTCC GCCTCTACGC ACCGGTAGTA 1140 
AACATATCCC GGTTACTCGA CAAACCCATC ACCTTTCCAG ATGGACGGTC CTTACCTGCA 1200 
GGAATAACTG TGTTTATCAA TATTTGGGCT CTTCACCACA ACCGCTATTT CTGGGAAGAC 1260 
CCTCAGGTCT TTAACCCCTT GAGATTCTCC AGGGAAAATT CTGAAAAAAT ACATCCCTAT 1320 
GCCTTCATAC CATTCTCAGC TGGATTAAGG AACTGCATTG GGCAGCATTT TGCCATAATT 1380 
GAGTGTAAAG TGGCAGTGGC ATTAACTCTG CTCCGCTTCA AGCTGGCTCC AGACCACTCA 1440 
AGGCCTCCCC AGCCTGTTCG TCAAGTTGTC CTCAAGTCCA AGAATGGAAT CCATGTGTTT 1500 
GCAAAAAAAG TTTGCTAATT TTAAGTCCTT TCGTATAAGA ATTAATGAGA CAATTTTCCT 1560 
ACCAAAGGAA GAACAAAAGG ATAAATATAA TACA AAATAT ATGTATATGG TTGTTTGACA 1620 
AATTATATAA CTTAGGATAC TTCTGACTGG TTTTGACATC CATTAACAGT AATTTTAATT 1680 
TCTTTGCTGT ATCTGGTGAA ACCCACAAAA ACACCTGAAA AAACTCAAGC TGACTTCCAC 1740 
TGCGA AGGG A AATTATTGGT TTGTGTAACT AGTGGTAG AG TGGCTTTCAA GC ATAGTTTG 1 800 
ATCAAAACTC CACTCAGTAT CTGCATTACT TTTATCTCTG CAAATATCTG CATG ATAGCT 1 860 
TTATTCTCAG TTATCTTTCC CCAATAATAA AAAA 



Seq ID NO: 2 Prolcin sequence: 

Protein Accession Ik FGENESH predicted 

1 11 21 31 41 51 

MEPSWLQBLM AHFFLLLILL CMSLLLFQVI K1YQRRRV/MI RALHLFPAPP AHWFYGHKEF 60 
YPVKEFEVYH KXMEKYPCAV PLWVGPFTMF FSVHDPDYAK ILLKRQDPKS AVSHKILESW 120 
VGRGLVTLDG SKWKKHRQIV KPGFNTSILK 1F1TMMSESV RMMLNKWEER IAQNSRLELF 1 80 
QHVSLMTLDS IMKCAFSHQG SIQLDSTLDS YLKAVFNLSK ISNQRMNNFL HHNDLVFKFS 24C 
SQGQIFSKFN QELHQFTEKV [QDRKESLKD KLKQDTTQKR RWDFLDILLS AKSENTKDFS 3 00 
EADLQAEVKT FMFAGHDTTS SAISW1LYCL AKYPEHQQRC RDEIRELLGD GSS1TWEHLS 360 
QMPYTTMCK ECLRLYAPVV NISRLLDKPI TFPDGRSLPA GITVFINIWA LHHNPYFWED 420 
PQVFNPLRFS RENSEKIHPY AFIPFSAGLR NCIGQHFAII ECKVAVALTL LRFKL APDHS 480 
RPPQPVRQVV LKSKNGIHVF AKKVC 

Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession it: NM_052997 

Coding sequence: 1004125 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

CTAGTCTATA CCAGCAACGA CTCCTACATC GTCCACTCTG GGGATCTTAG AAAGATCCAT 60 
AAAGCTGCCT CCCGGGGACA AGTCCGGAAG CTGGAGAAG A TGA CAAAGAG GAAGAAGACC 120 
ATCAACCTTA ATATACAAGA CGCCCAGAAG AGGACTGCTC TACACTGGGC CTGTGTCAAT 130 
GGCCATGAGG AAGTAGTAAC ATTTCTGGTA GACAGAAAGT GCCAGCTTGA CGTCCTTGAT 240 
GGCG A ACACA GG ACACCTCT G ATG AAGGCT CTACAATGCC ATCAGGAGGC TTGTGCAAAT 300 
ATTCTGATAG ATTCTGGTGC CGATATAAAT CTCGTAGATG TGTATGGCAA CATGGCTCTC 360 
CATTATGCTG TTTATAGTGA GATTTTGTCA GTGGTGGCAA AACTGCTGTC CCATGGTGCA 420 
GTCATCG AAG TGCACAACA A GGCTAGCCTC AC ACCACTTT TACTATCCAT AACG AAAAGA 480 
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AGTGAGCAAA TTOTGGAATT TTTGCTGATA AAAAATGCAA ATGCGAATGC AGTTAATAAG 54D 
TATAAATGCA CAGCCCTCAT GCTTGCTGTA TOTCATGGAT CATCAGAGAT AGTTGGCATG 600 
CTTCTTCAGC AAAATGTTGA CGTCTTTGCT GCAGATATAT GTGGAGTAAC TGCAGAACAT 660 
TATGCTGTTA CTTGTGGATT TCATCACATT CATGAACAAA TTATGGAATA TATACGAAAA 720 
TTATCTAAAA ATCATCAAA A TACCAATCCA GAAGG AACAT CTGCAGG A AC ACCTGATGAG 780 
GCTGCACCCT TGGCGGAAAG AACACCTGAC ACAGCTGAAA GCTTGGTGGA AAAAACACCT 840 
GATGAGGCTG CACCCTTGGT GGAAAGAACA CCTGACACGG CTGAAAGCTT GGtGGAAAAA 900 
ACACCTGATG AGGCTGCATC CTTGGTGGAG GGAACATCTG ACAAAATTCA ATGTTTGGAG 960 
AAAGCGACAT CTGGAAAGTT CGAACAGTCA GCAGAAGAAA CACCTAGGGA AATTACGAGT 1020 
CCTGCAAAAG AAACATCTGA GAAATTTACG TGGCCAGCAA AAGGAAGACC TAGGAAGATC 1080 
GCATGGGAGA AAAAAGAAGA CACACCTAGG GAAATTATGA GTCCCGCAAA AGAAACA'I CT 1 1 40 
GAGAAATTTA CGTGGGCAGC AAAAGU \ G T i C A «AA 

ACACCTGTAA AGACTGGATG CGTGGCAAGA GTAACATCTA ATAAAACTAA AGTTTTGGAA 1260 
AAAGGAAGAT CTAAGATGAT TGCATGTCCT ACAAAAGAAT CATCTACAAA AGCAAGTGCC 1320 
AATGATCAGA GGTTCCCATC AGAATCCAAA CAAGAGGAAG ATGAAGAATA TTCTTGTGAT 1 3 SO 
TCTCGGAGTC TCTTCGAG AG TTCTGCAAAG ATTCAAGTGT GTATACCTGA GTCTATATAT 1440 
CAAAAAGTAA TGGAGATAAA TAGAGAAGTA GAAGAGCCTC CTAAGAAGCC ATCTGCCTTC 1500 
AAGCCTGCCA TTGAAATGCA AAACTCTGTT CCAAATAAAG CCTTTGAATT GAAGAATGAA 1S60 
CAAACATTGA GAGCAGATCC GATGTTCCCA CCAGAATCCA AACAAAAGGA CTATGAAGAA 1620 
AATTCTTGGG ATTCTGAGAG TCTCTGTGAG ACTGTTTCAC AGAAGGATGT GTGTTTACCC 1680 
AAGGCTACAC ATCAAAAAGA AATAGATAAA ATAAATGGAA AATTAGAAGA GTCTCCTAAT 1740 
A AAOATGGTC TTCTGAAGGC TACCTGCGGA ATG AAAGTTT CTATTCCAAC TAAAGCCTTA 1 800 
G AATTGAAGG ACATGCAAAC TTTCAAAGCG G AGCCTCCGG GG AAGGCATC TGCCTTCG AG 1860 
CCTGCCACTG AAATGCAAAA GTCTGTCCCA AATAAAGCCT TGGAATTGAA AAATGAACAA 1920 
ACATGGAGAG CAG ATG AG AT ACTCCCATCA GAATCCAAAC AAAAGGACTA TGAAGAAAAT 1980 
TCTTGGGATA CTGAGAGTCT CTGTGAGACT GTTTCACAGA AGGATGTGTG TTTACCCAAG 2040 
GCTGCGCATC AAAAACiAAAT AGATA AA ATA AATCGAAAAT TAGAAGGOTC TCCTGTTAAA 7 1 03 
GATGGTCTTC TGAAGGCTAA CTGCGG AATG AAAtnTl C 1'A TTCCAACTAA AGCCTTAG AA 2 1 60 
TTGATGGACA TGCAAACTTTCAAAGCAGAG CCi'CC VGA AC I'CTGC CTTCGACiCCT 22 
GCCATTGAAA TGCAAAAGTC TGTTCCAAAT AAAGCCTTGG AATTGAAGAA TGAACAAACA 2280 
TTGAGAGCAG ATGAGATACT CCCATCAGAA TCCAAAC AAA AGGACTATGA AGAAAGTTCT 2340 
TGGGATrCTG AGAGTCTCTG TGAGACTGTT TCACAGAAGG ATGTGTGTTT ACCCAAGGCT 2400 
ACACATCAAA AAGAAATAGA TAAAATAAAT GGAAAATTAG AAGAGTCTCC TGATAATGAT 2460 
GGTTTTCTGA AGGCTCCCTG CAGAATGAAA GTTTCTATTC CA ACTAAAGC CTTAGAATTG 2520 
ATGGACATGC AAACTTTCAA AGCAGAGCCT CCCGAGAAGC CATCTGCCTT CGAGCCTGCC 2580 
ATTG A AATGC AAAAGTCTGT TCCAAATAAA GCCTTG GAAT TG AAGA ATG A ACAA ACATTG 2640 
AGAGCAGATC AGATGTTCCC TTCAGAATCA AAACAAAAGA AGGTTGAAGA AAATTCTTGG 2700 
GATTCTGAGA GTCTCCGTGA GACTGTTTCA CAGAAGGATG TGTGTGTACC CAAGGCTAGA 2760 
CATCAAAAAG AAATGGATAA AATAAGTGGA AAATTAGAAG ATTCAACTAG CCTATCAAAA 2820 
ATCTTGG ATA CAGTTCATTC TTGTG AAAG A GCA AGGGAAC TTCAAA AAG A TCACTGTGAA 2880 
CAACGTACAG GAAAAATGGA ACAAATGAAA AAGAAGTTTT GTGTACTGAA AAAGAAACTG 2940 
TCAGAAGCAA AAGAAATAAA ATCACAGTTA GAGAACCAAA AAGTTAAATG GGAACAAGAG 3000 
CTCTGCAGTG TGAGATTGAC TTTAAACCAA GAAGAAGAGA AGAGAAGAAA TGCCGATATA 3060 
TTAAATGAAA AAATTAGGGA AGAATTAGGA AGAATCGAAG AGCAGCATAG GAAAGAGTTA 3120 
GAAGTGAAAC AAOAACTTGA ACAGGCTCTC AGAATACAAG ATATAfiAATT GAAGAGTGTA 3180 
GAAAGTAATTTUAATCAGGTTTCTCACACTCATOAAAA'IGAA ATTA'l 1 H .C VT3AA 3240 
AATTG CATGT TG A AA AAGG A AATTGCCATG CTAAAACTGG AAATAGCCAC ACTGAAACAC 3300 
CAATACCAGG AAAAGGAAAA TAAATACTTT GAGGACATTA AGATTTTAAA AGAAAAGAAT 3360 
GCTGAACTTC AGATGACCCT AAAACTGAAA GAGGAATCAT TAACTAAAAG GGCATCTCAA 3420 
TATAGTGGGC AGCTTAAAGT TCTGATAGCT G AGAACACAA TGCTCACTTC TAAATTGAAG 34S0 
GAAAAACAAG ACAAAGAAAT ACTAGAGGCA GAAATTGAAT CACACCATCC TAGACTGGCT 3540 
TCTGCTGTAC AAGACCATGA TCAAATTGTG ACATCAAGAA AAAGTCAAGA ACCTQCTTTC 3600 
CACATTGCAG GAGATGCTTG TTTGCAAAGA AAAATGAATG TTGATGTGAG TAGTAGGATA 3660 
TATAACAATG AGGTGCTCCA TCAACCACTT TCTGAAGCTC AAAGGAAATC CAAAAGCCTA 3720 
AAAATTAATC TCAATTATGC AGGAGATGCT CTAAGAGAAA ATACATTGGT TTCAGAACAT 3780 
GCACAAAGAG ACCAACGTGA AACACAGTGT CAAATGAAGG AAGCTGAACA CATGTATCAA 3840 
AACGAACAAG ATAATGTGAA CAAACACACT GAACAGCAGG AGTCTCTAGA TCAGAAATTA 390D 
TTTCAACTAC AAAGCAAAAA TATGTG GCTT CAACAGCAAT TAGTTC ATGC AC ATA AG AAA 396D 
GCTGACAACA AAAGCAAGAT AACAATTGAT ATTCATTTTC TTGAGAGGAA AATGCAACAT 4020 
CATCTCCTAA AAGAGAAAAA TGAGGAGATA TTTAATTACA ATAACCATTT AAAAAACCGT 4080 
ATATATCAAT ATGAAAAAGA GAAAGCAGAA ACAGAAAACT CAJOAGAG AC AAGCAGTAAG 4140 
AAACTTCTTT TGGAGAAACA ACAGACCAGA TCTITACTCA CAACTCATGC TAGGAGGCCA 4200 
GTCCTACCAT CACCTTATGT TGAAAATCTT ACCAATAGTC TGTGTCAACA GAATACTTAT 4260 
TTTAGAAGAA AAATTCATGA TTTCTTCCTG AAGCCTACAG ACATAAAATA ACAGTGTGAA 4320 
GAATTACTTG TTCACGAATT GCATAAAGCT GCACAGGATT CCCATCTACC CTGATGATGC 4380 
AGCAGACATC ATTCAATCCA ACCAGAATCT CGCTCTGCAC TCCAGCCTAG GTOACAGAGT 4440 
GAGACTCCAC CTCGGAAA 



SeqlD NO: 4 Proteins 



Protein Aqcession ft NP 443723.1 



MTKRKKTINL MQDAQKRTA LHWACVNOHE EV VTFLVDRK. CQLDVLDGEH RTPLMKALQC 60 
HQEACANILt DSGADMLVD VYGNMALHYA VYSE1LSVVA "J.L HGAVI h L'l'PL 1 U 
LLS1TKRSEQ IVEFLLIKNA NANAVNKYKC TALMLAVCHG SSEIVGMU.Q QNVDVFAADI 180 
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CGVTAEHYAV TCGFHHIHEQ IMEY1RKLEK NHQNTNPEGT SAGTPDEAAP LAERTPDTAE 240 
SLVEKTPDEA APLVERTPDT AESLVEKTPD EAASLVEGTS DKIQCLEKAT SQKFEQSAEE 300 
TPRE1TSPAK ETSEKFTWFA KGRPEKIAWE KKEDTPREIM SPAKETSEKF TWAAKGRPRK 360 
1AWEKKETPV KTGCVARVTS NKTK.VLEKGR SKMIACPTKE SSTKASANDQ RFPSESKQEE 420 
5 UEEYSCDSRS LFESSAKIQV CIPESIYQKV MEINREVEEP PKKPSAFKPA IBMQNSVPNK 4S0 

AFELKNEQTL RADPMFPPES KQKDYEENSW DSESLCETVS QKDVCLPKAT HQKEIDK1NG 540 
fCLEESPNKDG LLKATCGMKV SIPTKALELK DMQTFKAEPP GKPSAFEPAT EMQKSVPNKA 600 
LELKNEQTWR ADEILPSESK QKDYEENSWD TESLCETVSQ KDVCLPKAAH QKEiDKTNGK 660 
LEGSPVKDGL LKANCGMKVS IPTKALELMD MQTFKAEPPE KPSAFEPAIE MQKSVPNKAL 720 
1 0 ELKNEQT1.P.A DEIU'SESKQ KDYBESSWDS ESLCRTVSQK DVCLPKATHQ KUIDKINGKL 780 
EESPDNDGFLKAPCKMRVSIITKALELMDM QTFKAEPPHKPSAFEPAIRMQKSVPNKALE 840 

l r n i o fpse t 1 bnswi h i kathqkemi gku 

DSTSLSKJLD TVHSCERARE LQKDHCEQRT GKMEQMKKKF CVLKKKLSEA KEtKSQLENQ 960 
KVSCWEQELCS VRLTLNQEEE KRRNAD1LNE KIREELGRIE EQHRKELEVK QQLEQALRIQ 1020 

1 5 DJELKS VESN LNQVSHTHEN ENYLLHENCM LKKEIAMLKX EIATLKHQYQ EKENKYFEDI 1 080 
KiLKEKNABL QMTLKLKEES LTKRASQYSG QLKVLIAENT MLTSKXKEKQ DKEILEAEIE ! 140 
SHHPRLASAV QDHDQIVTSR KSQEPAFHIA GDACLQRKMN VDVSSTIYNN EVLHQPLSEA 1200 
QRKSKSLKIN LNYAGDALRE NTLVSEHAQR DQRETQCQMK EAEHMYQNEQ DNVNKHTEQQ 1260 
ESLDQKLFQL QSKNMWLQQQ LVHAHKKADN KSK!TIDIHF LERKMQHHLL KEKNEEtFNY 1320 

20 NNHLKNRIYQ YEKEKAETEN S 

Seq ID NO: 5 DNA sequence 

Nucleic Acid Accession ft noiie found 

Coding sequence: 273-1 785 (underlined sequences correspond to start and stop codons) 



1 It 21 31 41 51 

CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA CCGCGGCGCT 60 

30 GAGGACGCGG GAGCCGGGAG CGCACGCGCG GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 120 
GTGAAAGGAA AGG AAG ATCA TTTCATGCCT TGTTG AT AAA GGTTCAGACT TCTGCTGATT 1 80 
CATAACCATT TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAGTTAAA CTTACAAGCC 240 
TGCCATAAGT GAGA AGCAAA CTTCCTTGAT AACATGCTTT TGCGAAGTGC AGGAAA ATT A 300 
AATGTGGGCA CCAAGAAAGA GGATGGTGAG AGTACAGCCC CCACCCCCCG TCCAAAGGTC 360 

3 5 TTGCGTTGTA AATGCCACCA CCATTGTCC A GAAGACTC AG TCAACAATAT TTGCAGCACA 420 
GACGGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTGT GGTCACTTCT 4SQ 
GGTTGCCTAG GACTAGAAGG CTCAGATTTT CAGTGTCGGG ACACTCCCAT TCCTCATCAA 540 
AGAAGATCAA TTGAATGCTG CACAGAAAGG AACGAATGTA ATAAAGACCT ACACCCTACA 600 
CTGCCTCCAT TGAAAAACAG AGATTTTGTT GATGGACCTA TACACCACAG GGCTTTACTT 660 

40 ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 

TATAAAAGAC AAGAAACCAG ACCTCGATAC AGCATTGGGT TAG AAGAGGA TGAAACTTAC 780 
ATTCCTCCTG GAGAATCCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 840 
TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTGAAACAG 900 
ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGOGA AAAGGTAGCT 960 

45 GTGAAAGTG'l f( FTC VCCAC AGAGO .AOCC \G< TGG'ITCA < iGAGAC \i A \ATA1 AT kG 1020 
ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 1080 
GGGTCGTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1 ! 40 
TATCTGAAGT CCACCACCCT AGACGCTAAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 1 200 
AGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC AAGGCAAACC AGCAATTGCC 1260 

5 0 CATCG AGATC TG AAAAGTA A AAACATTCTG GTGAAGAAAA ATGG AACTTG CTGTATTGCT 1 320 
GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG AAGTTGACAT ACCACCTAAC 1380 
ACTCGAGTTG GCACCAAACG CTATATOCCT CCAGAAGTGT TGGACGAGAG CTTGAACAGA 1440 
AATC ACTTCC AGTCTTACAT CATGGCTGAC ATGTATAGTT TTGGCCTCAT CCTTTGGGAG 1 500 
GTTGCTAGGA GATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC TTATCATGAC 1560 

5 5 CTAGTGCCCA GTGACCCCTC TTATGAGGAC ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 1620 
COCCCCTCAT TCCCAAACCG GTGGAGCAGT GATGAGTGTC TAAGGCAGAT GGGAAAACTC 1680 
ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 
ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACTCT GATAGGAGAG GAAAAGTAAG 1800 
CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 

60 TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 1920 
CTTTCAGGG A GCGACCTGGG CAAAGACAGA GAAGCTCC'JCA GAAGGAGAGA TTGATCCGTG 1980 
CTGTTTG'l - <-C,i, G „ CCC TG' 1 \ AGTTGTTCAA GATATGATGC AT 



65 Seq ID NO: 6 Protein sequence: 

Protein Accession ft none found 



1 11 21 31 41 51 

70 i I l I l l 

■ MIXRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCHHHCPE DSVNNICSTD GYCFTMIEED 60 
DSGLPVVTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTEBN ECNKDLHPTL PPLKNRDFVD 120 
GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI PPGESLRDLI 1 80 
EQSQSSGSGS GLPLLVQRT1 AKQIQMVKQI GKGRYGEVWM GKWRGEKVAV KVFFTTEEAS 240 
75 WFRETEIYQT VLMRHENILG FIAADKGTG SWTQLYLITD YHENGSLYDY LKSTTLDAKS 300 
MLKLA YSSVS GLCHLHTEIF STQGKPAIAH RDLKSKNILV KKNGTCCIAD LGLAVKF1SD 360 
TNEVD1PPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLILWEV ARRCVSGGIV 420 
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EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSFPNRWSSD ECLRQMOKLM TECWAHNFAS 4SO 
RLTALRVKKT LAKMSESQDI KL 

Seq ID NO: 7 DNA sequence 

Nucleic Acid Accession ft none found 

Coding sequence: 4S2-3007(un[Ierli!icd sequences correspond to start and slop codons) 



AACYGAGCTA ACAAGAAATA CTAGAAAAGG AGGAAGGAGA ACATTGCTGC AGCTTGGATC 60 
TACAACCTAA GAAAGCAAGA GTGATCAATC TCAGCTCTGT TAAACATCTT 01 TTACTTAC 120 
TGCATTCAGC AGCTTGCAAA TGGTTAACTA TATGCAAAAA AOTCAOCATA GCTGTGAAGT 180 
ATGCCGTGAA TTTTAATTGA GGGAAAAAGG GACAATTGCT TCAGGATGCT CTAGTATGCA 240 
CTCTGCTTGA AATATTTTCA ATGAAATGCT CAGTATTCTA TCTTTGACCA GAGGTTTTAA 300 
CTTTATGAAG CTATGGGACT TGACAAAAAG TGATATTTGA GAAGAAAGTA CGCAGTGGTT 360 
GGTGTTTTCr TTTTTTTAAT AAAGGAATTG AATTACTCTG AACACCTCTT CCAGCTGTGC 420 
ATTACAGATA ACGTCAGGAA GAGTCTCTGC TTTACAGAAT CGGATTTCAT CACATGACAA 4SO 
CATGAAGCTG TGGATTCATC TCTTTTATTC ATCTCTCCTT GCCTGTATAT CTTTACACTC 540 
CCAAACTCCA GTGCTCTCAT CCAGAGGCTC TTGTGATTCT CTTTGCAATT GTGAGGAAAA 600 
AGATGGCACA ATGCTAATAA ATTGTGAAGC AAAAGGTATC AAGATGGTAT CTGAAATAAG 660 
TGTGCCACCA TCACGACCTT TCCAACTAAG CTTATTAAAT AACGGCTTGA CGATGCTTCA 720 
CACAAATGAC TTTTCTGGGG TTACCAATGC TATTTCAATA CACCTTGGAT TTAACAATAT 780 
TGCAGATATT GAGATAGGTG CATTTAATGG CCTTGGCCTC CTGAAACAAC TTCATATCAA 840 
TCACA ATTCT TTAGAAATTC TTAAAGAGGA TACTTTCCAT GGACTGG AAA ACCTGG AATT 900 
CCTGCAAGCA GATAACAATT TTATCACAGT GATTGAACCA AGTGCCTTTA GCAAGCTCAA 960 
CAGACTCAAA GTGTTAATTT TAAATGACAA TGCTATTGAG AGTCTTCCTC CAAACATCTT 1020 
CCOi ATTTOIT CC 1TTAACCC ATCTAGATCT TCGTGGAAAT CAATTACAAA CATTGCCTTA 1080 
TOTTGtjTl I T CTCGAACACA TTGGCCGAAT ATrGGATCTT CAGTTGGAGG ACAACAAATG 1 140 
GGCCTGCAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATGC CTCCACAGTC 1200 
TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGGAAGTA TACTCAGTAG 1260 
ACTAAAGAAG GAATCTATTT GCCCTACTCC ACCAGTGTAT CAAGAACATG AGGATCCTTC 1320 
AGGATCATTA CATCTGGCAG CA ACATCTTC AATAAATGAT AGTCGCATGT CAACTAAG AC 1 3 SO 
CACGTCCATT CTAAAACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA TTACAAAGCC 1440 
ATCCACTCAA CTTCCAGGAC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC 1500 
ATCAGGACTT CTAATACATT GTCAGO AGCG CAACATTGA A AGCTTATCAG ATCTGAGACC 1 560 
TCCTCCGCAA AATCCTAGAA AGCTCATTCT AGCGGGAAAT ATTATTCACA GTTTAATGAA 1620 
GTCTGATCTA GTGGAATATT TCACTTTGGA AATGCTTCAC TTGGGAAACA ATCGTATTGA 1680 
AGTTCTTGAA GAAGGATCGT TTATGAACCT A ACGAOATTA CAAAAACTCT ATCTAAATGG 1740 
TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 1800 
ATATCTTGAA TACAATGCCA TTAAGGAAAT ACTGCCAGGA ACCTTTAATC CAATGCCTAA 1860 
ACTTAAAGTC CTGTATTTAA ATAACAACCT CCTCCAAGTT TTACCACCAC ATATTTTTTC 1920 
AGGGGTTCCT CTAACTAAGG TAAATCTTAA AACAAACCAG TTTACCCATC TACCTGTAAG 1980 
TAATATTTTG GATGATCTTG ATTTACTAAC CCAGATTGAC CTTGAGG ATA ACCCCTGGGA 2040 
CTGCTCCTGT GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 2100 
G ACAGATGAC ATCCTCTGCA CTTCCCCCGG GCATCTCG AC A A A A AGG A AT TGAAAGCCCT 2 1 60 
AAATAGTGAA ATTCTCTGTC C AGGTTTAGT A A ATA ACCCA TCCATGCC A A CAC AGACTAG 2220 
TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG GCTGATACTA TTTTACGATC 2280 
TCTTACGGAC GCTGTGCCAC TGTCTGTTCT AATATTGGGA CTTCTGATTA TGTTCATCAC 2340 
TATTGTTTTC TGTGCTGCAG GGATAGTGGT TCTTGTTCTT CACCGCAGGA GAAGATACAA 2400 
AAAGAAACAA GTAGATGAGC AAATGAGAGA CAACAGTCCT GTGCATCTTC AGTACAGCAT 2460 
GTATGGCCAT AAAACCACTC ATCACACTAC TGAAAGACCC TCTGCCTCAC TCTATGAACA 2520 
GCACATGGTG AGCCCCATGG TTCATGTCTA TAGAAGTCCA TCCTTTGGTC CAAAGCATCT 2580 
GGAAGAGGAA GAAGAGAGGA ATGAGAAAGA AGGAAGTGAT GCAAAACATC TCCAAAGAAG 2640 
TCTTTTGGAA CAGGAAAATC ATTCACCACT CACAGGGTCA AATATGAAAT ACAAAACCAC 2700 
GAACCAATCA ACAGAATTTT TATCCTTCCA AGATGCCAGC TCATTGTACA GAAACATTTT 2760 
AGAAAAAGAA AGGGAACTTC AGCAACTGGG AATCACAGAA TACCTAAGGA AAAACATTGC 2820 
TCAGCTCCAG CCTGATATGG AGGCACATTA TCCTGGAGCC CACGAAG AGC TG AAGTTAAT 2880 
GGAAACATTA ATGTACTCAC GTCCAAGGAA GGTATTAGTG GAACAGACAA AAAATGAGTA 2940 
TTTTGAACTT AAAGCTAATT TACATGCTG A ACCTG ACTAT TTAGAAGTCC TGG AGCAGCA 3000 
AACATAGATG GAGAGTTTGA GGGCTTTCGC AGAAATGC'l'G TGATTCTGTT TTAAGTCCAT 3C60 
ACCTTGTAAA TAAOTGCCTT ACGTGAGTGT GTCATCAATC AGAACCTAAG CACAGCAGTA • 120 
AACTATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 3180 
GCATTATCTT CAGGCAATTT AGTCTGTCCC AAATAAAATC AATCCTTGCA TGTAAATC 



Seq ID NO: 8 Protein secvuence: 
Protein Accession it: none found 

1 11 21 31 41 51 
I I I I I j 

MKLWIHLFYS SLLACiSLHS QTPVLSSRGS CDSLCNCEEK DGTMLMCEA KGKMVSEIS 60 
VPFSRPFQLS LLNNGLTMLH TNDFSGLTNA IS1HLGFNNI ADIEIGAFNG LGLLKQLHIN 1 20 
HNSLE1LKED TFHGLENLEF LQADNNFITV IEPSAFSKLN RLKVLILNDN A1E5LPPNIF 180 
RFVPLTHLDL RGNQLQTLPY VGFLEHIGRJ LDLQLEDNKW ACNCDLLQLKL TWLENMPPQS 240 
nGDVVCNSP PFFK.GSILSR LKKES1CPTP PVYEEF1EDPS GSLHLAATSS INDSRMSTKT 300 
TSILKLPTKA PGLIPY1TKP STQLPGPYCP IPCNCKVLSP SGLLIHCQER NIHSLSDLRP 360 
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PPQNPRKL1L AGNIIHSLMK. SDLVEYFTLE MLHLGNNPJE VLEEGSFMNL TRLQKLYLNG 420 
NHLTKLSKGM FLGLHNLEYL YLEYNAKEI LPGTFNPMPK LKVLYLNNNL LQVLPPK1FS 480 
GVPLTKVNLK TNQFTHLPVS N1LDDLDLLT Q1DLEDNPWD CSCDLVGLQQ WIQKLSKNTV 540 
TDDILCTSPGHLDKKELKALNSEILCPaLV>MPSMPTQTSYLMVTTPATT™TADTILRS 600 
5 LTDAVPLSVL ILGLLIMFIT IVFCAAGIVV LVLHKRRRYK KKQVDEQMRD NSPVHLQYSM 660 
YGHKTTHHTT ERPSASLYEQ HMVSPM VHVY RSPSFGPKHL EEEEERNEKE GSD AKHLQRS 720 
IXEQENHSPL TOSNMKYKTT NQSTEFLSFQ DASSLYRNIL EKERELQQLG ITEYLRKNIA 780 
QLQPDMEAHY PGAKEELKXM ETLM YSRPRK VLVEQTKNEY FELKANLHAE PDYLEVLEQQ 840 

io T 

Seq ID NO: 9 DNA sequence 

Nucleic Acid Accession #: NM_003474 

Coding sequence: 307-3036 (underlined sequences correspond to start and stop codons) 



1 11 il 31 41 51 

1)111! 

CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT TTGCTCATTT ATTGCAACGG 60 
TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGCGACGCA CGCACACACA CGGGGGGAAA 120 

2 0 CTTTTTTAAA AATGAAAGGC TAGAAG AGCT CAGCGGCGGC GCGGGCCGTG CGCG AGGGCT 180 

CCGGAGCTGA CTCGCCGAGG CAGGAAATCC CTCCGGTCGC GACGCCCGGC CCCGCTCGGC 240 
■ GCCCGCGTGG GATGGTGCAG CGCTCGCCGC CGGGCCCGAG AGCTGCTGCA CTGAAGGCCG 300 
GCGACGATGG CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGCCCTCCT GCTCGCCCTG 360 
GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTTATGGAA CGAAGGAAGA 420 

25 GCTGATGAAG TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATCCC AGTGAAGAGC 480 
TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCGAC TACAACGGGA AAGCAAAGAA 540 
CTGATCATAA ATCTGGA A AG AAATGAAGGT Ci'CATTGCC A GCAGTTTCAC GG AAACCCAC 600 
7ATCTGCAAG ACGGTACTGA TGTCTCCCTC GGTCGAAATT ACACGGTAAT TCTGGGTCAC 600 
TGTTACTACC ATGGACATGT ACGGGGATAT TCTGATTCAG CAGTCAGTCT CAGCACGTGT 720 

30 TCTGGTCTCA GGGGACTTAT TGTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATGAAA 780 
AGTGCAACCA ACAGATACAA ACTCTTCCCA GCGAAGAAGC TGAAAAGCGT GCGGGGATCA 840 
TGTGGATCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 
CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCTG 960 
GTGATCGTGG CAGACAACCG AGAGTTTCAG AGGCAAGGAA AAGATCTGGA AAAAGTTAAG 1020 

3 5 CAGCGATTAA TAGAGATTGC TAATCACGTT GACAAGTTTT ACAGACCACT GAACATTCGG 1080 

ATCGTGTTGG TAGGCGTGGA AGTGTGGAAT GACATOGACA AATGCTCTGT AAGTCAGGAC 1140 
CCATTCACCA GCCTCCATGA ATTTCTGGAC TGGAGGAAG A TGAAGCTTCT ACCTCGCAAA 1200 
TCCCATGACA ATGCGCAGCT TGTCAGTGGG GTTTATTTCC AAGGGACCAC CATCGGCATG 1260 
GCCCCAATCA TGAGCATGTG CACGGCAGAC CAGTCTGGGG GAATTGTCAT GGACCATTCA 1320 

40 GACAATCCCC TTGGTGCAGC CGTGACCCTG GCACATG AGC TGGGCCACA A TTTCGGGATG 1 3 SO 
AATCATGACA CACTGGACAG GGGCTGTAGC TGTCAAATGG CGGTTGAGAA AGGAGGCTGC 1440 
ATCATGAACG CTTCCACCGG GTACCCATTT CCCATGGTGT TCAGCAGTTG CAGCAGGAAG 1500 
GACTTGGAGA CCAGCCTGGA GAAAGGAATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 
AGGGAGTCTT TCGGGGGCCA GAAGTGTGGG AACAGATTTG TGGAAGAAGG AGAGGAGTGT ! 620 

45 GACTGTGGGG AGCCAGAGGA ATGTATGAAT CGC'i'GCTGCA ATCiCCACCAC CTGTACCCTG 1680 
AAGCCGGACG CTGTGTGCGC ACATGGGCTG TGCTGTGAAG ACTGCCAGCT GAAGCCTGCA 1740 
GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG CACAGGGGCC 1SD0 
AGCCCTCACT GCCCAGCCAA CGTGTACCTG CACGATGGGC ACTCATGTCA GGATGTGGAC I860 
GGCTACTGCT ACAATGGCAT CTGCCAGACT CACGAGCAGC AGTGTGTCAC ACTCTGGGGA 1920 

50 CCAGGTGCTA AACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1330 
TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGAGAT GAGAGATGCT 2040 
AAATGTGGAA AAATCCAGTG TCAAGGAGGT GCCAGCCGGC CAGTCATTGG TACCAATGCC 2100 
GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCTGTG CCGGGGGACC 2160 
CACGTGTACT TGGGCGATGA CATGCCGGAC CCAGGGCTTG TGCTTGCAGG CACAAAGTGT 2220 

5 5 GCAGATGG AA A AATCTGCCT GAATCGTCAA TGTCAAA ATA TTAGTGTCTT TGGGOTTCAC 2280 
GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGAAGAA CTGCCACTQC 2340 
GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTTTGGCT TTGGAGGAAG CACAGACAGC 2400 
GGCCCCATCG GGCAAGCAGA TAACCAAGGT TTAACCATAG GAATTCTGGT GACCATCCTG 2460 
TGTCTTCTTG CTGCCGGATT TGTGGTTTAT CTCAAAAGGA AGACCTTGAT ACGACTGCTG 2520 

60 TTTACAAATA AGAAGACCAC CATTGAAAAA CTAAGGTGTG TGCGCCCTTC CCGGCCACCC 2S80 
nGTOGCTTCC AACCCTOTCAGGCTCACCTC GGCCACCTTtl UAAAAGGO t t. GAA( I 

CCGCCAGATT CCTACCCACC GAAGGACAAT CCCAGGAGAT TGCTGCAGTG TCAGAATGTT 2700 
GACATCAGCA G ACCCCTCAA CGGCCTGAAT GTCCCTCAGC CCCAGTCAAC TCAGCGAGTG 2760 
CTTCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGCGTCC CTGCCAGACC CCTGCCAGCC 2S20 

65 AAGCCTGC AC TTAGGCAGGC CCAGGGG ACC TGTAAGCCAA ACCCCCCTCA G AAGCCTCTG 28B0 
CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2940 
CAATGGGAGA CTGGGCTCCG CCTGGCACCC CTCAGACCTG CTCCACAATA TCCACACCAA 3000 
GTGCCCAGAT CCACCCACAC CGCCTATATT AAGTGA.GAAG CCG ACACCTT TTTTCAACAG 3060 
TGAAGACAGA AGTTTGCACT ATCTTTCAGC TCCAGTTGGA GTTTTTTGTA CCAACTTTTA 3120 

70 GG ATTTTTTT TAATGTTTAA AACATCATTA CTATAAGA AC TTTGAGCTAC TGCCGTCAGT 31 80 
GCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACAGGTA CTTGTAAATT ATTAATTTAT 3240 
GCAGAATGTT GATTACAG7G CAGTGCGCTG TAGTAGGCAT TTTTACCATC ACTGAGTTTT 3300 
CCATGGCAGG AAGGCTTGTT GTGCTTTTAG TATTTTAGTG AACTTGAAAT ATCCTGCTTG 3360 
ATGGGATTCT GGACAGGATG TGTTTGCTTT CTGATCAAGG CCTTATTGGA AAGCAGTCCC 3420 

75 CCAACTACCC CCAGCTGTGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 34S0 
CTCAGTTGAT TTTCTGGATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 
TGTGTTTGGC TTTCAGGGAG GCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 
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ACACCTGGGA GAAATCTGGC TTCTGGCCAC GAAGCTTTGG TGAGAACCTG GGTTGCAGAC 3660 
AGGAATCTTA AGGTGTAGCC ACACCAGGAT AGAGACTGGA ACACTAGACA AGCCAGAACT 3720 
TGACCCTGAG CTGACCAGCC GTGAGCATGT TTGGAAGGGG TCTGTAGTGT CACTCAAGGC 3780 
GGTGCTTGAT AGAAATGCCA AGCACTTCTT TTTCTCGCTG TCCTTTCTAG AGCACTGCCA 3840 
5 CCAGTAGGTT ATTTAGCTTG GGAAAGGTGG TGTTTCTGTA AGAAACCTAC TGCCCAGGCA 3900 
CTGCAAACCG CCACCTCCCT ATACTGCTTG GAGCTGAGCA AATCACCACA AACTGTAATA 3960 
CAATGATCCT GTATTCAGAC AGATGAGGAC TTTCCATGGG ACCACAACTA TTTTCAGATG 4020 
TGAACCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACT GCAAGGTTCA ACTTATTAAC 4080 
AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TaTrCATGGG 4 140 

1 0 TATAGTTCAT GTCTGCTATC ATTATTCGTA G ATATTGG AC AAAGAACCTT CTCTATGGGG 4200 
CATCCTCTCT TTCCAACTTO GCTGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 
ACCTATTTCT TAAACACTTG CAACCTACCT GTTGAGCATC ACAGAATGTG ATAAGGAAAT 432D 
CAACTTGCTT ATCAACTTGC TAAATATTAT GAGATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 
CTCTTCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 

1 5 AATGGCATGA GAAATACAAA AATACTCAGA TAAGGTAAAA TGCCATGATG CCTCTGTCTT 4500 
CTGGACTGGT TTTCACATTA G AAGACAATT GAC AACAGTT ACATA ATTCA CTCTGAGTGT 4S60 
TTTATGAGAA AGCCTTCTTT TGGGGTCAAC AGTTTTCCrA TGCTTTGAAA CAGAAAAATA 4620 
TGTACCAAGA ATCTTGGTTT GCCTTCCAG A AAACAAAACT GCATTTCACT TTCCCGGTGT 4680 
TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTGACAC 4740 

20 AAACACACAC AAAAGGGAAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 
TTATTCTATA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 
AGATACATAC AGAATTACTG TAACTGATTA CACTTGGTAA TTGTACTAAA GCCAAACATA 4920 
TATATACTAT TAAAAAGGTT TACAG AATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 
AGATGCCCAA ATCCTTAGAT CTGGCATGTT AGCCCTTCCT CCAATTATAA GAGGATATGA 5040 

25 ACCAAAAAAA AAAAAAAAAA AA 



Set) 115 NO: 1 0 Protein sequence: 
Protein Accession ft NP_003465.2 

30 . 

1 II 21 31 41 Jl 

MAAKPLPVSP ARALLLALAGALLAPCEARG VSLWNEGRAD EWSASVRSG DLW1PVKSFD 60 
SKNHPEVLNI RLQRESKELI 1NLERNEGLI ASSFTBTHYL QDGTDVSLAR NYTVILGHCY 120 

35 YHGHVRGYSD SAVSLSTCSG LRGL1VFENE SYVLEPMKSA TMRYKLFPAK KLKSVRGSCG ISO 
SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELVI VADNREFQRQ GKDLErCVKQR 240 
LIEIANHVDK FYRPLNIR1V LVGVEVWNDM DKCSVSQDPF TSLHEFLDWR KMKLLPRKSH 300 
DNAQLVSGVY FQGTTIGMAP MSMCTADQS GG1VMDHSDN PLGAAVTLAH ELGHNFGMNH 360 
DTLDRGCSCQ MAVEK.GGCM NASTGYPFPM VFSSCSRKDL ETSLEKGMGV CLFNLPEVRE 420 

40 - ■ SFGGQKCGNR FVEEGEECDC GEPEECMNRC CNATTCTLKP DAVCAHGLCC EDCQLKPAGT 480 
A CRDSSNSCD LPEFCTGASP HCFANVYLHD GHSCQDVDGY CYNGICQTHE QQCVTLWGPG 540 
AKPAPGICFE RVNSAGDPYG NCGKVSKSSF AKCEMRDAKC GKIQCQGGAS RPVIGTNAVS 600 
IETNIPLQQG GRILCRGTHV YLGDDMPDPG LVLAGTSCCAD GK1CLNRQCQ N1SWOVHBC 660 
AMQCI1GRG VC NMJKKKCKCBA HWAPPFCDXF GFGGSTDSOP IRQ A DNQGLT IGILVTUCL 720 

45 I.AAC!F VVYLK RKTURUFT NKKTTIEKLR CVRPSRPPRG FQPCQAHLGH LGKGLMRKPP 780 
DSYPPKDNPR RLLQCQNVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAPS VPARPLPAKP 840 
ALRQAQGTCK PNPPQKPLPA DPLARTTRLT HALARTPGQW ETGLRLAPLR PAPQYPHQVP S00 
RSTHTAYIK 

50 Seq ID NO; 1 1 DNA sequence 

Nucleic Acid Accessioa ft NM_001 394 

Coding sequence: 400-1 5S4(underliued sequences correspond to start and stop codons) 

55 1 II 21 3t 41 51 

GGCACGAGGG CTGAGCGCCG GAGGAGCGTA GGCAGGGCAG CGCTGGCGCC AGCGGCGACA 60 
GGAGCCGCGC GACCGGCAAA AATACACGGG AGGCCGTCGC CGAAAAGAGT CCGCGGTCCT 120 
CTCTCGTA AA CACACTCTCC TCCACCGGCG CCTCCCCCTC CGCTCTGCGC GCCGCCCGGC 1 80 
TGGGCGCCCG AGGCCGCTCC GACTGCTATG TGACCGCGAG GCTGCGGGAG GAAGGGGACA 240 

60 GGG AAGAAG A GGCTCTCCCG CGGGAGCCCT TGAGGACCAA < iTTTGCGGCC ACTTCTGCAG 300 
GCGTCCCTTC TTAGPTC 1 CO CC'TT TCTG CAGCCT T ilinHlf uO 

r r | lJGTG 11 

CGGGAGATGG ACTGCAGTGT OCTCAAAAGG CTGATGAACC GGGACGAGAA TGGCGGCGGC 480 
GCGGGCGGCA GCGGCAGCCA CGGCACCCTG GGGCTGCCGA GCGGCGGCAA GTGCCTGCTG 540 

65 CTGGACTGCA GACCGTTCCT GGCGCACAGC GCGGGCTACA TCCTAGGTTC GGTCAACGTG 600 
CGCTGTAACA CCATCGTGCG GCGGCGGGCT AAGGG CTCCG TGAGCCTGG A GCAG ATCCTG 660 
CCCGCCGAGG AGGAGGTACG CGCCCGCTTG CGCTCCGGCC TCTACTCGGC GGTCATCGTC 720 
TACGACGAGC GCAGCCCGCG CGCCGAGAGC CTCCGCGAGG ACAGCACCGT GTCGCTGGTG 780 
GTGCAGGCGC TGCGCCGCAA CGCCGAGCGC ACCGACATCT GCCTGCTCAA AGGCGGCTAT 840 

70 G AG AGGTTTT CCTCCGAGTA CCCAG AATTC TGTTCTAAAA CCAAGGCCCT GGCAGCCATC 900 
CCACCCCCGG TTCCCCCCAG CGCCACAGAG CCCTTGGACC TGGGCTGCAG CTCCTOTGGG 960 
ACCCCACTAC ACGACCAGGG GGGTCCTGTG GAGATCCTTC CCTTCCTCTA CCTCGGCAGT 1020 
GCCTACCATG CTGCCCGGAG AGACATGCTG GACGCCCTGG GCATCACGGC TCTGTTGAAT 1080 
GTCTCCTCGG ACTGCCCAAA CCACTTTGAA GGACACTATC AGTACAAGTG CATCCCAGTG 1140 

75 GAAGATAACC ACAAGGCCGA CATCAGCTCC TGGTTCATGG AAGCCATAGA GTACATCGAT 1200 
GCCGTGAAGG ACTGCCGTGG GCGCGTGCTG GTGCACTGCC AGGCGGGCAT CTCGCGGTCG 1260 
GCCACCATCT GCCTGGCCTA CCTGATGATG AAGAAACGGG TGAGGCTGGA GGAGGCCTTC 1320 
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G AGTTCGTTA AGCAGCGCCG CAGC ATCATC TCGCCCAACT TCAGCTTCAT GGGGCAGCTG 1 380 
CTGCAGTTCG AGTCCCAGGT GCTGGCCACG TCCTGTGCTG CGGAGGCTGC TAGCCCCTCG 1440 
GGACCCCTGC GGGAGCGGGG CAAGACCCCC GCCACCCCCA CCTCGCAGTT CGTCTTCAGC 1500 
TTTCCGGTCT CCGTGGGCGT GCACTCGGCC CCCAGCAGCC TGCCCTACCT GCACAGCCCC 1560 
5 ATCACCACCT CTCCCAGCTG TTAGAGCCGC CCTGGGGGCC CCAGAACCAG AGCTGGCTCC 1620 

CAGCAAGGGT AGGACGGGCC GCATGCGGCA GAAAGTTGGG ACTGAGCAGC TGGGAGCAGG 1680 
CGACCGAGCT CCTTCCCCAT CATTTCTCCT TGGCCAACGA CGAGGCCAGC CAGAATGGCA 1740 
ATAAGGACTC CGAATACATA ATAAAAGCAA ACAGAACACT CCAACTTAGA GCAATAACCG 1800 
GTGCCGCAGC AGCCAGGGAA G ACCTTGGTT TGGTTTATGT GTCAGTTTCA CTTTTCCG AT 1860 

10 AGAAATTTCT TACCTCATTT TTTTAAGCAG TAAGGCTTGA AGTGATGAAA CCCACAGATC 1920 
i \TG TGCGCAA TT <■ \ M JCJACH 5GGAG GGATGAG 1 

AAGACAAGTT TCCCAGAAGT GCCTGGTTCT GGGTACITGT CCCTTTGTTG TCGTTG'ITGT 2040 
AGTTAAAGGA ATTTCATTTT TAAAAGAAAT CTTCGAAGGT GTGGTTTTCA TTTCTCAGTC 2100 
ACCAACAGAT GAATAATTAT GCTTAATAAT AAAGTATTTA TTAAGACTTT CTTCAGAGTA 2160 

1 5 TGAAAGTACA AAAAGTCTAG TTACAGTGGA TTTAGAATAT ATTTATGTTG ATGTCAAACA 2220 
GCTGAGCACC GTAGCATGCA GATGTCAAGG CAGTTAGGAA GTAAATGGTG TCTTGTAGAT 2280 
ATGTGCAAGG TAGCATGATG AGCAACTTGA GTTTGTTGCC ACTGAGAAGC AGGCGGGTTG 2340 
GGTGGGAGGA GGAAGAAAGG GAAGAATTAG GTTTGAATTG CTTTTTAAAA AAAAAAGAAA 2400 
AGAAAAAGAC AGCATCTCAC TATGTTGCCA AGGCTCATCT TGAGAAGCAG GCGGGTTGGG 2460 

20 TGGGAGGAGG AAGAAAGGGA AGAATTAGGT TTGAATTGCT TTTTTAAAAA AAAA 

Seq ID NO: 12 Protein sequence: 
Protein Accession «: NPJQ1385 

25 1 11 21 31 41 51 

MVTMJJBLREM DCSVLKRLMN RDENG GGAGG SGSHGTLGLP SGGKCILLDC KFFLAHSAGY 60 
lLCiSVNVRCN TWRRRAKGS VSLEQ1LPAE EEVRARLRSG I.YSA i i'DHRSPI ESLRI [20 
DSTVSLWQA LRRNAER TIM CLLK.GGYERF SSEYPEFCSIi TKALAAIPPP VPPSATEPI.D 1 SO 
30 LGCSSCGTPL HDQGGF VEIL PFLYLGSAYH AARRDMLDAL GITALLNVSS DCPNHFEGHY 240 
QYKCIPVEDN HKADISSWFM EA1EYIDAVK DCRGRVLVHC QAGISRSATI CLAYLMMKKR 300 
VRLEEAFEFV KQRRSHSPN FSFMGQLLQF ESQVLATSCA AEAASPSGPL RERGK.TPATP 360 
TSQFVFSFPV SVGVHSAPSS LPYLHSPITT SPSC 

35 Seq ID NO: 13 DNA sequence 

Nucleic Acid Accession ti: none found 

Coding sequence: 6S-340(iinderlined sequences correspond to start and stop codons) 



40 1 11 21 31 41 SI 

AGCGCCTTGC CTTCTCITAGGCTTTG AAGC ATTTTTGTCT GTGCTCCCTG ATCTTCAOGT 60 
CACCACCATG AAGTTCTTAG CAGTCCTGGT ACTCTTGGGA GTTTCCATCT TTCTGGTCTC 120 
TGCCCAUAA T CCGACAACAG CTGCTCCAGC TGACACGTAT CCAGCTACTG GTCCTGCTG A 1 80 

45 rGA'i'fiA >."4 l CCTJATGCTG AAACCACTGC TGCTGCAACC ACTGCGACCA r I i' IXiCTCC 240 
TACCACTGCA ACCACCGCTG CTTCTACCAC TGCTCGTAAA GACATTCCAG TTTTACCCAA 300 
ATGGGTTGGG G ATCTCCCGA ATGGTAG AGT GTGTCCCTGA GATGGA ATCA GCTTGAGTCT 360 
TCTGCAATTG GGTCACAACT ATTCATGCTT CCTGTGATTT CATCCA ACTA CTTACCTTGC 420 
CTACGATATC CCCTTTATCT CTAATCAGTT TATTTTCTTT CAAATAAAAA ATAACTATGA 480 

50 GCGAGCTAAC AT 



III! 



Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession *\ NM_016640,2 

Coding sequence: 39-1 358(underlined sequences correspond to start and stop codons) 

65 

1 11 21 31 41 51 

GCTTAAGTTG ACCTCTGGGT CCGG AATCGC GGGCAA AGAT GGCGGCGGCC AGGTGTTGGA 60 
70 GGCCTTTGCT ACGCGGTCCG AGGCTTTCAT TGCACACCGC GGCTAATGCC GCCGCCACGG 120 
CTACAGAAAC GACCTCCCAA GACGTCGCGG CG ACCCCCGT CGCGCGGTAC CCGCCGATTG 1 80 
TGGCCTCCAT GACAGCCG AC AGCAAAGCTG CACGGCTGCG GCGGATCGAG CGCTGGCAGG 240 
CGACGGTGCA CGCTGCGGAG TCGGTAGACG AGAAGCTGCG AATCCTCACC AAGATGCAGT 300 
TTATG A ACTA CATGGTTTAC CCGCAGACCT TCGCACTGAA TGCCGACCGC TGGTACCAGT 360 
75 ACTTCACCAA GACCGTGTTC CTGTCGGGTC TGCCGCCGCC CCCAGCGGAG CCCGAGCCCG 420 
AGCCCGAACC CGAACCTGAA CCTGCGCTGG ACCTCGCGGC GCTGCGTGCG GTCGCCTGCG 480 
ACTGCCTGCT GCAGGAGCAC TTCTACCTGC GGCGCAGGCG GCGCGTGCAC CGTTACGAGG 540 
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AGAGCGAGGT CATATCTTTG CCCTTCCTGO ATCAGCTGGT GTCAACCCTC GTGGGCCTCC 600 
TCAGCCCACA CAACCCOGCC CTGGCCGCTG CCGCCCTCGA TTATAGATOC CCAGTTCATT 660 
TTT ACTGGGT GCGTGGTGAA GAAATTATTC CTCGTGGTCA TCG AAGAGOT CGAATTGATG 720 
ACTTGCGATA CCAGATAGAT GATAAACCAA ACAACCAGAT TCGAATATCC AAGCAACTCG 7B0 
5 CAGAGTTTGT GCCATTGGAT TATTCTGTTC CTATAGAAAT CCCCACTATA AAATGTAAAC 840 
CAGACAAACT TCCATTATTC AAACGGCAGT ATGAAAACCA CATATTTGTT GGCTCAAAAA 900 
CTGCAGATCC TTGCTGTTAC GGTCACACCC AGTTTCATCT GTTACCTGAC AAATTAAGAA 960 
GGG AAAGQCT TTTG AGACAA AACTGTGCTG ATCAGATAGA AGTTGTTTTT AGAGCTAATG 1 020 
CTATTGCAAG CCTTTTTGGT TGGACTGGAG CACAAGCTAT GTATCAAGGA TTCTGGAGTG 1OS0 

1 0 AAGCAGATGT TACTCGACCT TTTGTCTCCC AGOCTGTGAT CACAGATGGA AAATACTTTT 1 140 
CCTTTTTCTG CTACCAGCTA AATACTTTGCJ CACTGACTAC ACAAGCTGAT CAAAATAACC 1200 
CTCGTAAAAA TATATG'!TGG GGTACACAAA GTAAGCCTCT TTATGAAACA ATTGAGGATA 1 2 50 
ATGATGTGAA AGGTTTTAAT GATGATGTTC TACTTCAGAT AGTTCACTTT CTACTQAATA 1320 
GACCAAAAGA AGAAAAATCA CAGCTGTTGG AAAACTGAAA AAGCATATTT GATTGAGAAC 1380 

15 TGTGGGAATA TTTAAATTTT ACTGAAGG AA CAATAATGAT GAGATTTGTA ACTGTCAACT 1440 
ATTAAATACA TTGATTTTTG AGACAAATAT TTCTTATGTC AACCTGTTAT TAGATCTCTT 1500 
ACTCTGCTCA AATTCATCAC TGAAAGATTT AATTTTAGTT ACCnTTGTT GATTTAAAAA 1560 
TAATTGCATT TGTATATTGC TAACTGATAA GACAAATTG A GTTATTGAGC TATTAAATGC 1 620 
ACATTTTAAT ATAAATGCAG AAATCCCAAA TAAAATGCTA ACATACTGAA TTCAGTAATT 1680 

20 AAAAGAACCC ACTGC 



Seq ID NO: 16 Protein seauence: 
^ Protein Accession?/: NP„057724.1 

1 11 2! 31 41 51 
1 I I I I I 

MAAARCWRPL LRGPRLSLHT AANAAATATE TTSQDVAATP VARYPP1VAS MTADSKAARL 60 
RRIERWQATV HAAESVDEKL RILTKMQFMK YMVYPQTFA1. NAPP ^ « 1 I YF .jVrLSGLPP 120 

30 PPAEPEPEPE PEPEPALDLA ALRAVACDCL LQEHFYLRRR RRVHRYEESE VISLPFLDQL 180 

VSTLVGLLSP HNPALAA AAL DYRCPVHFYW VRGEEIIFRG HRRGRIDDLR YQ1DDKPNNQ 240 
IRISKQLAEF VPLDYSVPIE IFTTKCKPDK LPLFKRQYEN H1FVGSKTAD PCCYGHTQFH 300 
LLPDKLRRER LLRQNCADQI EVVFRANAtA SLFAWTGAQA MYQGFWSEAD VTRPFVSQAV 360 
. ITDGKYFSFF CYQLNTLALT TQADQNNPRK NICWGTQSKP LYETIEDNDV KGFNDDVIXQ 420 

3 5 1VHFLLNRPK EEKSQLLEN 

Seq ID NO: 17 DNA sequence 

Nucleic Acid Accession #: NMJK5059.1 

Coding sequence: 3-2 1 50 (underlined sequences correspond to start and stop codons) 



I II 21 31 41 51 
I I 1 I 1 I 

GCATGAGCCT GGACTGCACC AGCCATATCG CGCTGGG7G0 CGC'ITCGCCA 0COCCCGAGG CO 

45 AAACTTACGA TCATCTTTCG GAAGTCCCGG TCACGCGGGA GCAG'ITAAAC CACTATCGGA 120 
ATGTGGCTC A A A ATGCTCG A AGTG A ACTTG CAGC AACTTT G GTCAAATTT G A ATGTGCTC 1 80 
AGTCTGAGCT TCAAGACCTC CGATCCAAGA TGCTTTCTAA AGAAGTCTCC TGTCAAGAAC 240 
TGAAAGCTGA AATGGAGAGC TACAAGGAAA ACAATGCCAG AAAATCATCT CTCCTTACCT 300 
CTTTGAGAGA CAGAGTTCAG GAACTAGAAG AAGAATCAGC AGCACTTTCC ACTTCTAAAA 360 

5 0 TCAGA ACAGA AATCACAGCT CACGCTCCAA TCAAGGAGAA CCAGGAATTA AAGAAGAAAG 42D 
TTGTAGAOTT AAATGAAAAA TTACAAAAGT GTTCAAAAGA AAATGAGGAG AATAAGAAAC 480 
. AAGTTTCAAA GAATTGCAGG AAACATGAGG AATTTCTGAC TCAACTGCGT GACTGCTTGG 540 
ATCCAGATGA GAGGAATGAC AAGGCATCAG ATGAAGATTT AATTTTAAAG CTTAGAGACC 600 
TGCGCAAAGA AAATGAATTC GTGAAAGGAC AAATTGTTAT TCTTGAAGAG ACTATAAATG 660 

55 TCCATGAGAT GGAAGCAAAA GCTAGCAGAG AAACGATCAT GAGGCTGGCT TCAGAAGTCA 720 
ACAGAGAGCA GAAAAAAGCT GCCTCCTGTA CTGAAGAGAA AGAGAAGCTG AACCAGGACC 780 
TGCTCAGTGC TGTAGAAGCA AAAGAAGCTC TTGAAAGGGA AGTTAAGATC TTCCAAGAAA 840 
GGCTGCTTGC TGGCCAGCAG GTCTGGGATG CCTCAAAGCA GGAAGTGAGC CTCCTGAAGA S00 
A A AGCTCTTC TG AGTTGGAG AAG AGTTTG A AGGCCAGTCA GGATGCAGTC ACA ACCTC AC MO 

60 AAAGCCAGTA CTCCTCATTT AGGGAGAAAA TCGCAGCCCT CCTTAGGGGC AGATTGAGCA 1020 
TGACTGGGTC CACTGAGGAC ACCATTTTGG AGAAGATl'CG AGAAATGGAC AGCCGGGAAG 1080 
AAAGCAGCiGA CCGGATGGTC TCCCAGCTTG AAGCCCAAAT ATCTGAGCTT GTTGAACAGT 1 140 
TGGGAAAGGA GTCTGGGTTT CACCAGAAAG CTCTCCAGAG GGCCCAGAAA GCAGAGAATA 1200 
TGTTGGAGAC TCTTCAGGGT CAGCTGACAC ACCTGGAGGC AGAGCTGGTT TCTGGAGGTG 1260 

65 TTTTGCG AGA CAACTTGAAT TTTGAG AAAC AAAAATATCT TAAATTTCTG GATCACCTTT 1 320 
CTCAGAAAAT GAAGTTGGAC CAGATGGCTG CCGAACTTGG CTTTGACACG CGGCTGGACG 1380 
TGGTTTTAGC TCGAACAGAG CAGCTGGTTC GTCTTGAGAG CAATGCAGTC ATTGAGAACA 1440 
AG ACCATTGC CCACAATTTG CAG AGAAAGC TAA AGACAC A GAAAGAGAG A CTGGAGAGCA 1 500 
AAGAATTACA CATGAGCCTC CTCCGGCAGA AAATAGCCCA GCTGGAGGAG GAGAAGCAGG 1560 

70 CACGCACGGC CTTGGTGGTT GAGAGGGACA ACGCGCATCT TACCATCAGG AACTTGC AG A 1 620 
AGAAGGTGGA GAGGCTGCAG AAAGAGCTQA ACACGTGTCG AGACTTGCAC ACCGAGCTCA 1680 
AAGCCAAACT GGCCGACACC AATGAACTGA AGATTAAAAC TTTGGAACAG ACTAAAGCCA 1740 
TTGAAGATCT AAACAAATCC AGAGACCAAC TGGAGAAGAT GAAGGAGAAA GCTGAGAAAA 1800 
AGCTCATGTC TGTCAAGTCA GAACTGGATA CCACAGAACA TGAGGCTAAG GAGAATAAAG I860 

75 AAAGGGCCAG AAACATGATA GAAGTGGTAA CCAGTGAAAT GAAGACACTA AAAAAATCTC 1920 
TGGAAGAAGC AGAAAAGAGA GAAAAGCAOC TGGCAGACTT CAGGGAGGTG GTGTCGCAGA 1980 
TGCTAGGCTT GAACGTGACC AGCCTTGCTC TTCCTGATTA TGAAATCATC AAGTGTCTTG 2040 
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AAAGATTGGT CCATTCACAT CAGCATCACT TTGTTACCTG TGCCTGCCTC AAAGATGTOA 2100 
CTACTGGGCA AGAGAGGCAC CCACAAGGCC ATTTACAGCT TCTTCATJGA ACACTGTATC 2160 
TCTTGAGAGA GGTGGCCATA AGACATGGCA CACAATTCCC AATTTCACAA ATTCCTCATG 2220 
TCTTTGAGAT TTG ATCAGTT TGTG AATATT TT ATGCTTTG ATGATATAGT GAGAATGCAT 22SO 
5 CACTTGCAAA AACGATCTCA AAAGTGTCAG CCTTAG ATAA ACGTCAGCAT TAAAAAACGC 2340 
CAAAAAAAAA AAAAAAAAGC ATTTTAGGAT CCAGAAGAAT TCCACCAGAT TGCATGAGTT 2400 
AGATTGGGAA ATGGGAGTGG GAGATAATAT TGGGAGGTAT CTATTTTAAG TCAGGGGCTT 2460 
TACTAGCCG A TTTAGTTCTC ACAATAACCA TGTGGAG A AG CTGTG ACATT TTTAATTTAC 2520 
AACCTTTCTG GGGCTCAGAC ATAAAGTTAC CTATCCAAGG TTGCAGTTGG GTAGTGGTGG 25BO 

1 0 GACCAGGATG GACAAC J CAT TGGCCCTGCC TCAAAAGCCA TACCTCTTCT CCTGCTA'J GC 26-10 
AG AATCTGTT TCTCCTCAAT CTCTGTGATG CTGGTGGGAA 'lTQlTTGCAT AGAGGAAGGA 2700 
CAATAACCCT GCCATCGTGA GTTAATGTCC GGGCTGGTCA CAGTGGTTCA TGCCTGTAAT 2760 
CCCAGCACTT TGGGAGTCCA AGGCAGGCAT ATCAITTGAG GTCAGGAGTT TAAGACCAGC 2820 
CTGGCTAACA TAGTGAGACC CTGTTTCTAC TAAAAATACA AAAATAAGCC AGGTGTGGTG 2880 

1 5 GTGCATGACT G7/AATCCCAA CTACTCAGCA GGAGAAGCAC TTGAACCCAG GAGACGGAGG 2940 
CGGCAGTGAG CCAAGATTGT GCCACTGCAC TCCAGCCTGG GCGACAGAGT GAAACTACAT 3000 
CTCAGAAAAA AAAAAAAAAA AAAAAAAAAA AAGTTAATGT CCAAAAATGA CAGATTTACA 3060 
AGTGTAAGCT ATATGATTTC TTCAAAAAGC AAAAGCAATA TACCTAATTC ATTTGGATCA 3120 
AACTTACATA GGTCTCAGGT CCTGTAAG AA ACTTGCCTGT TCTAACTGTT GCTACCAG AT 3 1 BO 

20 TATATCTGGT GGTAATTGTT AATGTTTCAG GAGGGCTGGT CTCAGTCCTT TAAAATGGAA 3240 
AGCTTTATTT GGAGCCCAAC CCTATAAGAT GAAGAAATCC TATATAGTCT TATTCACCAA 3300 
TATATCGAAT ACACCCACAG CAATGGTACC TTTTTA AGAT CAGG ATTTTA TTATG AATTC 3360 
CTGTCACTTT CTGTTTTCCA TTTAAATTTC TATTTTACAA ATTTTTCAGG GAATCATATT 3420 
CTTAACTTCA CTGAGAAAAA TGTGTTACTC TTTTGG ACAA TTTATCTTAT TTCTATCATA 3480 

25 TAAGATGTAT TTTTTTATTG TCCTTAAAAG AAGCTCTAGC ATGAAATTAA AGGAAAGGGA 3540 
AAGAAATAGA TCTGGTGCAC CCGAACATTA GGAGAAAATG AAAAATATAC AACCAACCGT 3600 
TCGTGAGTCA TCAAAAAGTC AAAGTCAGCC TGGCCAACAT GGCAAAACTC CGTCTCTGCA 3660 
AAAAATACAA AAATGAGCCCGGTATGTTGG CATATGCCTG [ >1 CCAGCTACTi iAGAG 72C 
GCTGAGACAC GAAAATTGCT 70AACCTQGO AGGCGGAGGT TGCAATfiAGC CGAGATCGCG 3780 

3 0 CTACTGCACT CCAGCCTGGG CAACAG AG AG AGACCTTGTC TCAAAAAACA ACAACAACAA 3840 
AAAGTCAAAG TCATAATAAG CAAATTATTG GCTTCTTTCT TCTAGACYAA AAGAAATTAA 3900 
AGAGATGAAA CAATCAATTG CAAGGGTCAA AACTAGATTG GATCTTGGTT TGAATGAAAA 3960 
AAAGCATAAA ATATTCTTGC AATAATTGTA AAA ATTTGAA TGTGGACTAA GTCCTAGATT 4020 
ATATTAA AAT ATTTTTAATT TTTTAAGCTT G ACA A ATGCA CTGATTOTTA TACTTTAAAT 4080 

3 5 AACTA A AAAT CTGAGA ATCC ACAGTGCTAC AG ACA ATAAA TGATAAAATG GG AAAAAAAA 4 1 40 
AAAAAAAAAA A 



Sea ID NO: 18 Protein sequence: 
40 Protein Accession*: NPJ79335.1 

1 11 21 31 41 51 

MSLDCTSfflA LGA ASPAPEE TYDHLSEVPV TREQLNHYRN VAQNARSELA ATLVKFECAQ 60 
45 SEI-QDUtSKM LSKEVSCOEL KAEMES YKEN NARKSKU.TS LRDRVQELEE ESAALSTSKI 120 
RTE1TAHAAI KENQELKKKV VELNEKLQKC SKENEENKKQ VSKNCRKHEE FLTQLRDCLD 1 80 
PDEKNJDKASD EDLILKLRDL RKENEFVKGQ IVJLEETINV HEMEAKASRE TMRLASEVN 240 
REQKKAASCT EEKEKLNQDL LSAVEAKEAL EREVKIFQER LLAGQQVWDA SKQEVSLLKK 300 
SSSELEKSLK ASQDAVTTSQ SQYSSFREKI AALLRGRLSM TGSTEDTILE KIREMDSRBE 3 60 
50 SRDRMVSQLE AQ1SELVEQL GKESGFHQKA LQRAQKAENM LETLQGQLTH LEAELVSGGV 420 
LRDNliNFEKQ KYLKFLDQLS QKMKLDQMA A ELGFDTRLDV VLARTEQLVR LESNAV1ENK 480 
TTAHNLQRKL KTQKERLESK ELHMSLLRQK IAQLEEEKQA RTALVVERDN AHLTiRNLQK 540 
KVERLQKELN TCRDL.HTELK AKLADTNELK1 IKTLEQTKAI EDLNKSRDQL EKMKEftAEKK 600 
LMSVKSELDT TEHEAKENKE RARNMIEVVT SEMKTLKKSL EEAEKREKQL ADFREVVSQM 660 
55 LGLNVTSLAL PDYEIIKCLE RLVHSHQHHF VTCACLKDVT TGQERHPQGH LQLLH 



Scq ID NO: 1 9 DNA sequence 

Nucleic Acid Accession #: AF071S52, NM0D0662 

60 Coding sequence: 441-13 13 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

CTTTGTATA A GGCTCAGCTA AAAGGG AAAT TG AGTGGGTC AGGTACCACG G ATACTATAC 60 
ACTCTATTGC ATGATTCTCC TGCCTACATC AGAAGACGTT TATAAGCCTA TTTTAAAGGA 120 
TACCAGTTGG AATCTCTCTT TTATTAATCA CCAAGAGAAC CATGAACAAG CTGTTTATCA 1 80 
TTTG ACTCAT CATTTAATCT TG ATTTCCAG CTTCTCACAC TTGAAAG AAG ACATAATACA 240 
TTTCTCACAG GATTCTGGGA CTATTAACTG AACTTATGTG TGTAAAAGGA ATTCATACAA 300 
TGAAAGCACT AGAAATAATT ATTATACTTA TAACCATTGT ATTTTTACAT GTTTAAAATA 360 
TAGCCATAAT TAGCCTACTC AAATCCA AGT GTA AAA GTAA AATGATTTGC TTTCGTTTTG 420 
TTTTCCTTGC TTAGGGGATC ATGGACATTG AAGCATATCT TGAAAGAATT GGCTATAAGA 480 
AGTCTAGGAA CAAATTGGAC TTGGAAACAT TAACTGACAT TCTTCAACAC CAGATCCGAG 54D 
CTGTTCCCTT TGAGAACCTT AACATCCATT GTGGGGATGC CATGGACTTA GGCTTAGAGG 600 
CCATTTTTGA TCAAGTTGTG AGAAGAAATC GGGGTGGATG GTGTCTCCAG GTCAATCATC 660 
TTCTGTACTG GGCTCTGACC ACTATTGGTT TTGAGACCAC GATGTTGGGA GGGTATGTTT 720 
ACAGCACTCC AGCCAAAAAA TACAGCACTG GCATG ATTCA CCTTCTCCTG CAGGTGACCA 780 
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TTGATGGCAG GAACTACATT OTCGATGCTG GGTTTGGACQ CTCATACCAG ATGTGGCAGC B4D 
CTCTGGAGTT AATTTCTGGG AAGGATCAGC CTCAGGTGCC TTGTGTCTTC CGTTTGACGG 900 
AAGAGAATGG ATTCTGGTAT CTAGACCAAA TCAGAAGGGA ACAGTACATT CCAAATGAAG 960 
AATTTCTTCA TTCTGATCTC CTAGAAGACA GCAAATACCG AAAAATCTAC TCCTTTACTC 1020 
5 TTA AGCCTCG AACAATTGAA GATTTTGAGT CTATGAATAC ATACCTGCAG ACATCTCCAT 1 080 
CATCTGTGTT TACTAGTAAA TCATTTTGTT CCTTGCAGAC CCCAGATGGG GTTCACTGTT 1140 
TGGTGGGCTT CACCCTCACC CATAGGAGAT TCAATTATAA GGACAATACA GATCTAATAG 1200 
AGTTCAAG AC TCTGAGTG AG GAAGAAATAG AAAAAGTGCT GAAAAATATA TTTAATATTT 12C0 
CCTTGCAGAG AAAGCTTGTG CCGAAACATG GTGATAGATT TTTTACTATT TAGAATAAGG 1320 

1 0 AGTAAAACAA TCTTGTCTAT TTGTCATCCA GCTCACCAGT TATCAACTGA CGACCTATCA 13S0 
TGTATCTTCT GTACCC IT AC CTTATTTTGA AGAAAATCCf AGACATCAAA TCATTTCAOC 1440 
TATAAAAATG TCATCATATA TAATTAA ACA GCTTTTTAAA GAAACATAAC CACAAACCTT 1 5C0 
TTCAAATAAT AATAATAATA ATAATAATAA ATGTATTTTA AAGATGGCCT GTGGTTATCT 1560 

j ^ TGGAAATTGG TGATTTATGC TAG A AAGCTT TTAATGTTGG TTTATTGTTG AATTC 

Seq ID NO: 20 Protein sequence: 
Protein Accession ft NP_0006S3. 1 



20 1 11 21 31 41 51 

ill!)! 

MD1EAYLERI GYKKSKNKLD LETLTDILQH Q1RAVFFENL NIHCGDAMDL GLEA1FDQV V 60 
RRNRGGWCLQ VNHLLYWALT TIGFETTMLG GYVYSTPAKK YSTGMIHLLL QVTIDGRNYI 1 20 
VDAGFGRSYQ MWQPLELISG KDQPQVPCVF RLTBENGFWY LDQIRREQY1 PNEEFLHSDL ISO 

2 5 LEDSKYRKIY SFTLKPRTIE DFESMNTYLQ TSPSSVFTSK SFCSLQTPDG VHCLVGFTLT 240 

HRRFNYKDNT DLIEFKTLSE EEIEKVLKNI FNISLQRKLV PKHGDRFFTI 

Seq ID NO: 21 DNA sequence 

Nucleic Acid Accession ft NM_003714 

3 0 Coding sequence: 123-103 1 (underlined sequences correspond to start and stop codons} 



1 11 21 3! 41 51 

3 5 CGGCACG AGC AAAAAGGAAG AGTGGGAGGA GGAGGGG AAG CGGCG AAGGA GGAAG AGGAG 60 
GAGGAGGAAG AGGGGAGCAC AAAGGATCCA GGTCTCCCGA CGGGAGGTTA ATACCAAGAA 120 
CCATGTGTGC CGAGCGGCTG GGCCAGTTCA TGACCCTGGC TTTGGTGTTG GCCACCTTTG ISO 
ACCCGGCGCG GGGGACCGAC GCCACCAACC CACCCGAGGG TCCCCAAGAC AGGAGCTCCC 240 
AGCAGAAAGG CCGCCTGTCC CTGCAGAATA CAGCGGAGAT CCAGCAGTGT TTGGTCAACG 300 

40 CTGGCGATGT GGGGTGTGGC GTGTTTGAAT GTTTCGAGAA CAACTCTTGT GAGATTCGGG 360 
GCTTACATGG GATTTGCATG ACTTTTCTGC ACAACGCTGG AAAATTTGAT GCCCAGGGCA 420 
AGTCATTCAT CAAAGACGCC TTGAAATGTA AGGCCCACGC TCTGCGGCAG AGGTTCGGCT 480 
GCATAAGCCG GAAGTGCCCG GCCATCAGGG AAATGGTGTC CCAGTTGCAG CGGGAATGCT 540 
ACCTCAAGCA CGACCTGTGC GCGGCTGCCC AGGAGAACAC CCGGGTGATA GTGGAGATGA 600 

45 TCCATTTCAA OGAC iTGCTG CTGCACGAAC CCTACGTGGA CCTCGTGAAC ■ TCCTGCTGA 660 

CCTGTGGGGA GGAGGTGAAG GAGGCCATCA CCCACAGCGT GCAGGTTCAG TGTUAGCAG A 720 
ACTGGGGAAG CCTGTGCTCC ATCTTGAGCT TCTGCACCTC GGCCATCCAG AAGCCTCCCA 780 
CGGCGCCCCC CGAGCGCCAG CCCCAGGTGG ACAGAACCAA GCTCTCCAGG GCCCACCACG 840 
GGGAAGCAGG ACATCACCTC CCAGAGCCCA GCAGTAGGGA GACTGGCCGA GGTGCCAAGG 900 

50 GrGAGCGAGG TAGCAAGAGC CACCC AAACG CCCATGCCCG AGGCAG AGTC GGGGGCCTTG 960 
GGGCTCAGGG ACCTTCCGGA AGCAGCGAGT GGGAAGACGA ACAGTCTGAG TATTCTGATA 1020 
TGCGGAGGTGAAATGAAAGG CCTGGCCACG AAATCTl'TCC TCCACGCCGT CCATTTTCTT 1080 
ATCTATGGAC ATTCCAAAAC ATTTACCATT AGAGAGGGGG GATGTCACAC GCAGGATTCT 1 140 
GTGGGGACTG TGGACTTCAT CGAGGTGTGT GTTCGCGGAA CGGACAGGTG AGATGGAGAC !200 

5 5 CCCTGGGGCC GTGGGGTCTC AGGGGTGCCT GGTGAATTCT GCACTTACAC GTACTCAAGG 1 260 
GAGCGCGCCC GCGTTATCCT CGTACCTTTG TCTTCTTTCC ATCTGTGGAG TCAGTGGGTG 1320 
TCGGCCGCTC TGTTGTGGGG GAGGTGAACC AGGOAGGGGC AGGGCAAGGC AGGGCCCCCA 1380 
GAGCTGGGCC ACACAGTGGG TGCTGGGCCT CGCCCCGAAG CTTCTGGTGC AGCAGCCTCT 1440 
GGTGCTGTCT CCGCGGAAGT CAGGGCGGCT GGATTCCAGG ACAGGAGTGA ATGTAAAAAT 1500 

60 AAATATCGCT TAGAATGCAG GAGAAGGGTG GAGAGGAGGC AGGGGCCGAG GGGGTGCTTG 1560 
GTGCCAAACT GAAATTCAGT TTCTTGTGTG GGGCCTTCCG GTTCAG AGCT CTTGGCGAGG 1 620 
GTGGAGGGAG GAGTGTCATT TCTATGTGTA ATTTCTG AGC CATTGTACTG TCTGGGCTGG 16R0 
GGGGGACACT GTCCAAGGGA GTGGCCCCTA TGAGTTIATA TTTTAACCAC TGCTTCAAAT 1740 
CTCGATTtCA CTTTTTTTAT TTATCC AGTT ATATCTACAT ATCTGTCATC TAAATAAATG 1 800 

65 GGTTTCAAAC AAAAAAAAAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 22 Protein sequence: 
Protein Accessions: NP 003705 

70 

1 11 21 31 41 51 
I I I I 1 1 

MCAERLGQFM TLALVLATFD PARGTDATNP PEGPQDRSSQ QKGRLSLQNT AEIQHCLVNA 60 
ODVGCG VFEC FENNSCEIRG LHGICMTFLH NAGKFDAQGK SFKDALKCK AHALRHRFGC 1 20 
75 JSRKCFAIRE MVSQLQRECY LKHDLCAAAQ ENTRVrVEMI HFKDLIXHEP YVDLVNLLLT 180 
CGEEVKEA1T HSVQVQCEQN WGSLCSILSF CTSAIQKPPT APPERQPQVD RTKXSRAHHG 240 
EAGHHLPEPS SRETGRGAKG ERGSKSKPNA HARGRVGGLG AQGPSGSSEW EDEQSEYSDI 30O 
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Seq ID NO: 23 DNA sequence 
5 Nucleic Acid Accession #: NMJMS264. 1 

Coding sequence: 557-1 954(underlined sequences correspond to start and stop codons) 

] H 21 31 41 51 

10 | | ] | I I 

GAATTCCGGC CAGAAGAAAT CTGGCCTCGG AACACGCCAT TCTCCGCGCC GC 1TCCAATA 50 
accactaaca TCCCTAACGA GCATCCGAGC CGAGGGCTCT GCTCGQAAAT CGTCCTGGCC 120 
CAACTCGGCC CTTCGAGCTC TCGAAGATTA CCGCATCTAT TTTTTTTTTC TTTnTTTCT 180 
TTTCCTAGCG CAGATAAAGT GAGCCCGGAA AGGGAAGGAG GGGGCGGGGA CACCATTGCC 240 

15 CTGAAAGAAT AAATAAGTAA ATAAACAAAC TGGCTCCTCG CCGCAGCTGG ACGCGGTCGG 300 
TTGAGTCCAG GTTGGGTCGG ACCTGAACCC CTAAAAGCGG AACCGCCTCC CGCCCTCGCC 360 
ATCCCGGAGC TGAGTCGCCG GCGGCGGTGG CTGCTGCCAG ACCCGGAGTT TCCTCTTTCA 420 
CTGGATGGAG CTGAACTTTG GGCGGCCAGA GCAGCACAGC TGTCCGGGGA TCGCTGCACG 480 
CTGAGCTCCC TCGGCAAGAC CCAGCGGCGG CTCGGGATTT TTTTGGGGGG GCGGGGACCA 540 

2 0 GCCCCGCGCC GGCACCATGT TCCTGGCGAC CCTGTACTTC GCGCTGCCGC TCTTGGACTT 600 

GCTCCTGTCG GCCGAAGTGA GCGGCGGAGA CCGCCTGGAT TGCGTGAAAG CCAGTGATCA 660 
GTGCCTGAAG GAGCAGAGCT GCAGCACCAA GTACCGCACG CTAAGGCAGT GCGTGGCGGG 720 
CAAGGAGACC AACTTCAGCC TGGCATCCGG CCTGGAGGCC AAGGATGAGT GCCGCAGCGC 780 
CATGGAGGCC CTGAAGCAGA AGTCGCTCTA CAACTGCCGC TGCAAGCGGG GTATGAAGAA 840 
25 GGAGAAGAAC TGCCTGCGCA TTTACTGGAG CATGTACCAG AGCCTGCAGG GAAATGATCT 900 
GCTGGAGGAT TCCCCATATG AACCAGTTAA CAGCAGATTG TCAGATATAT TCCGGGTGGT 960 
CCCATTCATA TCAGATGTTT TTCAGCAAGT GGAGCACATT CCCAAAGGGA ACAACTGCCT 1020 
GGATGCAGCG AAGGCCTGCA ACCTCGACC A CAT'lTGCAAG AAGTACAGGT CGGCGTACAT 1080 
CACCCCGTGC ACCACCAGCG TGTCCAACGA TGTOTGCAAC CGCCGCAAGT GCCACAAGGC 1140 

3 0 CCTCCGGCAG TTCTTTGACA AGGTCCCGGC CA AGCACAGC TAGGGAATGC TCTTCTGCTC 1200 

CTGCCGGGAC ATCGCCTGCA CAGAGCGGAG GCGACAGACC ATCGTGCCTG TGTGCTCCTA 1260 
TGAAGAGAGG GAGAAGCCCA ACTGTTTGAA TTTGCAGGAC TCCTGCAAGA CGAATTACAT 1320 
CTGCAG ATCT CGCCTTGCGG ATTTTtTTAC CAACTGCCAG CCAGAGTCAA GGTCTGTCAG 1380 
CAGCTGTCTA AAGGAAAACT ACGCTGACTG CCrCCTCGCC TACTCGGGGC TTATTGGCAC 1440 

35 AGTCATGACC CCCAACTACA TAGACTCCAG TAGCCTCAGT GTGGCCCCAT GGTGTGACTG 1500 
CAGCAACAGT GGGAACGACC TAGAAGAGTG CTTGAAATTT TTGAATTTCT TCAAGGACAA 1560 
- TACATGTCTT AAA AATGCAA TTCAAGCCTT TGGCAATGGC TCCG ATGTGA CCGTGTGGCA 1 620 
GCGAGCCTTC CCAGTACAGA CGACCACTGC CACTACCACC ACTGCCCTCC GGGTTAAGAA 1680 
CAAGCCCCTG GGGCCAGCAG GGTCTGAGAA TGAAATTCCC ACTCATGTTT TGCCACCGTG 1 740 

40 TGCAAATTTA CAGGCACAGA AGCTGAAATC CAATGTGTCG GGCAATACAC ACCTCTGTAT 1800 
TTCCAATGGT AATTATG A AA A AG AAGGTCT CGGTGCTTCC AGCCACATAA CCACAAA ATC I860 
AATGGCTGCT CCTCCAAGCT GTGGTCTGAG CCCACTGCTG GTCCTGGTGG TAACCGCTCT 1920 
GTCCACCCTA TTATCTTTAA CAGAAACATC ATAGCTGCAT TAAAAAAATA CAATATGGAC 1980 
ATGTAA AAAG ACAAA AACCA AGTT ATCTGT TTCCTGTTCT CTTGTAT A GC TGAAATI CCA 2040 

45 GT1TAGGAGC TCAGTTGAGA AACAGTTCCA TTCAACTGUA ACATHTTTT mi'CGTl "IT 2100 
AAGAAAGCTT CTTGTGATCC TTCGGGGCTTCTGTGAAAAA CCTGATGCAG TGCTCCATCC 2160 
A A ACTCAG AA GGCTTTGGG A TATGCTGTAT TTTAA AGGG A CAGTTTGTAA CTTGGGCTGT 2220 
A AAGCAAACT GGGGCTGTGT TTTCGATGAT GATG ATCATC ATGATCATGA TGATTTTAAC 2280 
AGTTTTACTT CTGGCCTTTC CTAGCTAGAG AAGGAGTTAA TATTTCTAAG GTAACTCCCA 2340 

5 0 TATCTCCTTT AATGACATTG ATTTCTAATG ATATAAATTT CAGCCTACAT TGATGCCAAG 2400 
CTTTTTTGCC ACAAAGAAGA TTCTTACCAA GAGTGGGCTT TGTGGAA ACA GCTGGTACTG 2460 
ATGTTCACCT TTATATATGT ACTAGCATTT TCCACGCTGA TGTTTATGTA CTGTAAACAG 2520 
TTCTGCACTC TTGTACAAAA GAAAAAACCA CCCOGAATTC 




MFLATLYFAL PLLDLLLSAE VSGGDRLDCV KASDQCLKEQ SCSTKYRTJ.R QCVAGKETNF 60 
SLASGLEAKDECRSAMEALKQKSLYNCRCKRCf.il- I L r SMYQS1 i^DLLEDSP 120 
YBPVNSRLSD 1FRWPFJSD VFOQVBHffiK GNNCLDAAKA CNLDDICKKY RSA Vl'JTClT 1 80 
SVSNDVCNRR KCHKALRQFF DKVPAKHSYG MLFCSCRDIA CTERRRQTIV PVCSYEEREK 240 
PNCLNLQDSC KTNYICRSRL ADFFTNCQPE SRSVSSCLKE NYADCLLAYS GLIGTVMTPN 300 
Y1DSSSLSVA PWCDCSNSGN DLEECLKFLN FFKDNTCLKN AIQAFGNCSD VTVWQFAFPV 360 
QTTTATTTTA LRVKNKFLGP AGSENEEPTH VLPPCANLQA QKLKSNVSGN THLCISNGNY 420 
EKEGLGASSH ITTKSMAAPP SCGLSPLLVL VVTALSTLLS LTETS 

Scq ID NO: 25 DNA sequence 

Nucleic Acid. Accession ft FGENESH predicted 

Coding sequence; I576(entire sequence represents open reading frame) 



CTTTTGTTTC GCCATGCCTA GTCTAGTGGT ATCTGGAATA ATGGAAAGAA ATGGGGGCTT 60 
TGGAGAACTA GGATGTTTCG GGGGAAGCGC TAAGGACCGA GGGCTGCTGG AAGACGAGCG 120 
CGCCCTTCAG CTGGCTCTCG ATCAACTCTG CCTCCTGGGT TTGGGGGAGC CCCCCGCCCC 180 
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CAGGGCGGGC GAGGACGGGG GAGGTGGGGG GGGCGGCGCC CCCGCGCAGC CGACAGCCCC 240 
CCCGCAGCCG GCGCCGCCGC CGCCGCCCGC GGCGCCCCCG GCCGCCCCGA CGACGGCCCC 300 
CGCAGCGCAG ACGCCCCAGC CCCCCACCGC CCCCAAAGGG GCGAGCGACG CCAAGCTCTG 360 
CGCTCTCTAC AAAGAGGCCG AGCTGCGCCT GAAGGGCAGC AGCAACACCA CGGAGTGTGT 420 
5 TCCCGTGCCC ACCTCCGAGC ACGTGGCCGA GATCGTGGGC AGGCAAGGCT GCAAGATTAA 480 
GGCCTTGAGG GCCAAGACCA ACACCTACAT CAAGACACCG GTGAGGGGCG AGGAACCAGT 540 
GTTCATGGTG ACAGGGCGAC GGGAGGACGT GGCCACAGCC CGGCGGGAAA TCATCTCAGC 600 
AGCGGAGCAC TTCTCCATGA TCCGTGCCTC CCGCAACAAG TCAGGCGCCG CCTTTGGTGT 660 
GGCTCCTGCT CTGCCCGGCC AGGTGACCAT CCGTGTGCGG GTGCCCTACC GCGTGGTGGG 720 

10 I' GTGGTt «. ' ' CA A AG GGGCAACCAT CAAGCGCATC CAGCAGC AAA CCAACACATA 7B0 
CATTATCACA CCAAGCCGTG ACCGCGACC CGTG'i CGAG ATCACGGGTG ( CCAGGC A t* 
CGTGt f GCGCGCGAG 1 * .a( GCACATCGCG GTGCGCAC'l'G GCAAGATCCT 900 

CGAGTACAAC AATGAAAACG ACTTCCTGGC GGGGAGCCCC GACGCAGCAA TCGATAGCCG 960 
CTACTCCGAC GCCTGGCGGG TGCACCAGCC CGGCTGCAAG CCCCTCTCCA CCTTCCGGCA i020 

1 5 GAACAGCCTG GGCTGCATCG GCGAGTGCGG AGTGGACTCT GGCTTTGAGG CCCCACGCCT 1 030 
GGGTGAGCAG GGCGGGGACT TTGGCTACGG CGGGTACCTC TTTCCGGGCT ATGGCGTGGG 1140 
CAAGCAGGAT GTGTACTACG GCGTGGCCGA GACTAGCCCC CCGCTGTGGG CGGGCCAGGA 1200 
GAACGCCACG CCCACCTCCG TGCTCTTCTC CTCTGCCTCC TCCTCCTCCT CCTCTTCCGC 1260 
CAAGGCCCGC GCTGGGCCCC CGGGCGCACA CCGCTCCCCT GCCACTTCCG CGGGACCCGA .320 

20 GCTGGCCGGA CTCCCGAGGC GCCCCCCGGG AGAGCCGCTC CAGGGCTTCT CTAAACTTGG 13S0 
TGGGGGCGGC CTGCGGAGCC CCGGCGGCGG GCGGGATTGC ATGGTCTGCT TTGAGAGCGA 1440 
AGTGACTGCC GCCCTTOTGC CCTGCGGACA CAACCTGTTC TGCATGGAGT GTGCAGTACG 1500 
CATCTGCGAG AGGACGGACC CAGAGTGTCC CGTCTGCCAC ATCACAGCCA CGCAAGCCAT 1560 

^ CCGAATATTC TCCTAA 

Seq ID NO: 26 Protein seouence: 

Protein Accession H: FGIiNBSH predicted 



30 1 11 21 31 41 51 

FCFAMPSLVV SGMERNGGF GELGCFGGSA KDRGLLEDER ALQLALDQLC LLGLGEPPAP 60 
RAGEDGGGGG GGAPAQPTAP PQF AFFPPPA APPAAPTTAP AAQTPQPPTA PStGASDAKLC 1 20 
ALYKEAELRL KGSSNTTECV PVPTSEHVAE 1VGRQGCKISC ALRAKTNTYI KTPVRGEEPV 180 

35 FMVTGRREDV ATARREIISA AEHFSM1RAS RNKSGAAFGV APALPGQVT1 RVRVPYRWG 240 
LVVGPfCGATI KRIQQQTNTY IITPSRDRDP VFEITGAPGN VERAREESET HIAVRTGKIL 300 
EYNNENDFLA GSPDAAIDSR YSDAWRVHQP GCKPLSTPRQ NSLGC1GECG VDSGFEAPRL 360 
GEQGGDFG YG GYLFPG YGVG KQDVYYGVAE TSPPLWAGQE NATPTSVLFS SASSSSSSSA 420 
KARAGPPGAH RSPATSAGPE LAGLPRRPPG EPLQGFSKLG GGGLRSPGGG RDCMVCFESE 4S0 

40 VTA ALVPCGH NLFCMECAVR 1CERTDPECP VCHITATQ AI RIFS 

Seq ID NO: 27 DNA sequence 

Nucleic Acid Accession ft FGENESH predicted 

Coding sequence: 1 -2070 (underlined sequences correspond to start and stop endons) 

45 

1 11 21 31 41 51 

ATGAGCGGTG CGGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCTCT 60 
CGGCGCCGGC GCCAGCGCCC CTCTGTGGGC GTCCAGTCCT TGAGGCCGCA GAGCCCGCAG 120 
50 CTCAGGCAGA GCGACCCGCA GAAACGGAAC CTGGACCTGG AGAAAAGCCT GCAGTTCCTG 180 
CAGCAGCAGC ACTCGGAGAT GCTGGCCAAG CTCCATGAGG AGATCGAGCA TCTGAAGCfiG 2« 
GAAAACAAGG GTGAGCCGGC GCGGGGGCCT AGGCCGGCCC TGCCTCCCCA GGCACACTCA 300 
ACACTGCCGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 360 
GGGGGAACAC AGGACGGGGA GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 

5 5 CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGG ACCT GG ACAGATGC CGCTACCTCT 480 

AGCCGTGGCT GGACGATGTT ATGCAGCCAA GCACAGCACG TGCTGCTCTC GGGAAGCCCA 540 
GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCCCTCCT 600 
CCAAGTAGAG CTGAAATGGG AAGGAACCCC TGGGACAGCC CCTGCCCTGC TAGATCTTTG 660 
CCTCAGATTG CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 

6 0 ATGCTGGGGG CCCAGGGG AT ATGGACACAC TCCATCCAGG GATCCCTTCC TGCC ATCTGG 780 

3i ICiCAAC) I ij^tceLr \C \AA(J(jGAGGAAGC AGAGTCCTGTTTCCTTGCCA CTTi L'CCAAG mo 
GCAC'iTCCCC ATCCTGACAG CGGCCCCCAC CCAGCCCAGG ATCTTr GGCT GTGGTCTC 00 
GCTC ACTTCC CATTATCTTT GGGGCTGGGG CTG ACATCAG GAGGACATCT GACTGGTGGA 960 
TGGAGCCAGC CTGGGAACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTCCCAGGGA 1020 

65 GACATGGAGA AGGGGGTTGA GGGAGGGCCC TTCCCTAGCC GGTGTGGCAA CTCCAGTGAG 1 0B0 
CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC CCTGCAGTGC TGGGGACGCT 1 140 
GACAGGACAC GGGAAGAGGC CATGCTTTCC CTCGGGACCT GCTGTTCCAT GTGTCCCAAG 1200 
CCCTCCTG CT TTCCAG ATGG CCCCTCAGG A AACCACCTTT CCAGGGCCTC TGCTCCCTTG 1 260 
GGCGCTCGCT GGGTCTGCAT CA ACGG AGTG TGGGTAGAGC CGGGAGGACC CAGCCCTGCC 1320 

70 AGGCTGAAGG AGGGCTCCTC ACGGACACAC AGGCCAGGAG GCAAGCGTGG GCGTCTTGCG 1380 
GGCGGTAGCG CCGACACTGT GCGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAG 1440 
TCTGTCAAGT CCATCTCTAA TTCAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCQGC 1500 
TCCTTCAACA AGCAAGATTC AAAAGCTGAC GTCTCCCAGA AGGCGGACCT GGAAGAGGAG S560 
CCCCTACTTC ACAACAGCAA GCTGGACAAA GTTCCTGGGG TAGAAGGGCA GGCCAGAAAG 1620 

75 GAGAAAGCAG AGGCCTCTAA TGCAGGAGCT GCCTGTATGG GGAACAGCCA GCACCAGGGC 1 680 
AGGCAGATGG GGGCGGGGGC ACACCCCCCA ATGATCCTGC CCCTTCCCCT GCGAAAGCCC 1740 
ACCACACTTA GGCAGTGCG A AGTGCTCATC CGCGAGCTGT GG AATACCAA CCTCCTGCAG 1 800 
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ACCCAAGAGC TGCGGCACCT CAAGTCCCTC CTGGAAGGGA GCCAGAGGCC CCAGGCAGCC 1 860 
CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCCACGC ATTTCCCCAA GGTCTCCACC 1 520 
AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCCCGCA 1980 
CTGAAGCAGA CCCCGAAGAA CAACTTTGCC GAGAGGCAGA AGAGGCTGCA GGCAATGCAG 2040 
AAACGGCGCC TGCATCGCTC AGTGCTTTGA 

Seq ID NO: 28 



MSG AG V AAGT RPPSSPTPGS RRRRQRPSVG VQSLRPQSPQ LRQSDPQKRN LDLEK.SLQFL 60 
QQQHSEMLAK LHEEIEHLKR ENKGEPARGP RPALPPQAHS TLPLPQHRNT AINSSTRLGS 1 20 
1 5 GGTQDGEPLQ TVLAHLAALA PVCQPSGYRF WGTWTDAATS SRGWTMLCSQ AQHVLLSGSP J 80 
GPEVIAGRQV ATGCSPDLPP PSRAEMGRNP WDSPCPARSL FQIA AVARPR ISSPMALSPH 240 
MLGAQGEWTH S1QGSLPA1W AATMGTKGGS RVLFPCHLSK. ALPHPDSGPH PAQDPGLWSQ 300 
AHFPLSLGLO LTSOGHLTGG WSQPGNIAAG AVPRALP5QG DMEKGVEGGP FPSRCGNSSE 360 
LFWAKCGPSR QPQPCSAGDA DRTREEAMLS LGTCCSMCPK. PSCFPDGPSG NHLSRASAPL 420 

2 0 GARWVCINGV WVEPGGPSPA RLKEGSSRTH RPGGKRGRLA GGSADTVRSP ADSLSMSSFQ 450 

SVKSISNSAN SQGKARPQPG SFNKQDSKAD VSQKADLEEE PLLHNSKLDK VPG VQGQARK 540 
EKAEASNAGA ACMGNSQHQG RQMQAGAHPP M1LPLPLRKP TTLRQCEVLI RELWNTNLLQ 600 
TQELRHLKSL LEGSQRPQAA PEEASFPRDQ EATHFPKVST KSLSKKCLSP PVAERAJLPA 660 
^ ^ LKQTPKNNFA ERQKRLQAMQ KRRLHRSVL 

Seq ID NO: 29 DNA sequence 
Nucleic Acid Accession #: 
Coding sequence: 136-24 

30 

1 11 21 31 41 51 

CTCGTGCCGA ATTCGGCACG AQACCGCGTG TTCGCGCCTG GTAQAGATTT CTCGAAGACA 60 
CCAGTGGGCC CGTGTGOAAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CAACOAQGCC 120 

3 5 GCGGAG ACG A AGGCGCAATG GCGAGG AAGT TATCTGTAAT CTTG ATCCTG ACCTTTGCCC 180 

TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 240 
AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 
ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 360 
TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTA A AAGAATCCAT ATACACCATG 420 

40 ACCACGACCA TCAGTCAGAC CACGAGCATC ACTCAGACCA TGAGCGTCAC TCAGACCATG 480 
AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 540 
CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 600 
GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 660 
GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 720 

45 TCTCTGAAGG AACTCACTTY CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 780 
CCAAAGATGT AAGCAGCTCC ACTCC ACCC A GTGTCACATC AAAGAGCCGG GTG AGCCGGC 840 
TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 900 
GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 960 
QCATGGGCAT CCAGGTTCCG CTGAATGCAA CAG AGTTCAA CTATCTCTGT CCAGCCATCA 1 020 

50 TCAACCAAAT TGATGCTAGA TCTTGTCTG A TrCATACAAG TGAAAAGAAG GCTGAAATCC 1080 
CTCCA AAGAC CTATTCATTA C AA ATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 1 1 40 
TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 1200 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 1260 
TACACCTTCT TCCACATTCT CATGCA AGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 1 320 

55 CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 13E0 
GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATQT 1440 
TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 1500 
AGAAG AAACC TGAAAATG AT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 156D 
CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1620 

60 G AGCAGACTC ACAAG AGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGG AAGA AG 1680 
AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGG A AUTCTA CAATGAATAT GTACCCAQAG 1740 
GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 1 ROD 
TTCACCACCA TCATGACTAC CATCATATTC TCCATC ATCA CCACCACCAA A ACCACCATC 1 860 
CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 1920 

65 TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 
GTGCTGCTTT TACTG AAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 
ATGAGTTGCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 2100 
AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 2160 
GAATTTTCAT TGQTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 

70 GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 2280 
GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 2340 
GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 
TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAA AA AGTTG TCATAGTTTC AGTAGGTCAT- 2460 
AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 2520 

7 5 rTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 2580 
TATTCTATCT TGG AGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 
TAAACAAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 2700 
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TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 2760 
TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2820 
AGCAAAGAAA TAAAGGAGAA AAGAGAAQ A A TCTGAGAATT GGGGAGGCAT AGATTCTTAT 2880 
AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 2940 
5 CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 3000 
GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAGAAATCT 3060 
AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 3120 
TTCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 3180 
TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 
1 0 CAAAATTATC AGAGTAGTAA AACTTTGATA TATATGAGGA TATTAAAACT ACACTAAGTA 3300 
TCATTTGATT CG ATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCAITGT 3360 
GAGCAATTGT C'lTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATGTTTCTT TTTTACACAA TAAATTCCTT ATATCAGCTT G 

15 Sea ID NO: 30 Protein sequence: 

Protein Accession ft 1 : NP„03 645 1 .2 

1 II 21 31 41 51 

20 MARKLSVILI LTFALSVTNP LHELKAAAFP QTTEKISPNW ESGfflVDLAI STRQYHLQQL 60 

FYRYGENNSL SVEGFRKLLQ NIGIDKIKRI HIHHDKDHHS DHEHHSDHER HSDHEHHSDH 120 
EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE HASGRRNVKD 1 80 
SVSASEVT5T VYNTVSEGTH FLETIETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 
NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAIINQIDA 300 

25 RSCLIHTSEK KAEJPPKTYS LQIAWVGGFf. A1SIISFLSL LGVILVPLMN RVFFKFLLSF 360 

LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPT.FSHLSSQ MEESAYFDS 420 
TWKGLTAUiG LVFMFLVEHV LTLKQFKDK ECKKNQKKPfN DDD L t v I a tSQLSTN 480 
EF.K.VDTDDR T EGYLRADSQE PSHFDSQQPA VI JiKEE VMi A HAHPQEVYNE YVPRGCKNKC 540 
HS11FHDTLGQ SDDLIHHHHD YHHILHlllLh' QNH111 1 . HSQ KYSREELKDA GVATLAWMVI 600 

3 0 MGDGLHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLKA GMTVKQAVLY 660 
NALSAMLAYL GMATGIFIGH YAENVSMWIF ALTAGLFMYV ALVDMVPEML HNDASDHGCS 720 
RWGYFFLQNA GMLLGFGIML L1SIFEHK1V FPJNF 

Seq ID NO: 31 DNA sequence 
35 Nucleic Acid Accession*: NM 002184.] 

Coding sequence: 256-301 2( underlined sequences correspond to start and stop codDns) 

1 11 21 31 41 51 

40 G AGCAGCCAA AAGGCCCGCG GAGTCGCGCT GGGCCGCCCC GGCGCAGCTG AACCGGGGGC 60 
CGCGCCTGCC AGGCCGACGG GTCTGGCCCA GCCTGGCGCC AAGGGGTTCG TGCGCTGTGG 120 
AG ACGCGGAG GGTCGAGGCG GCGCGGCCTG AGTG AAACCC AATGGAAA AA GCATGACATT 1 80 
TAGAAGTAGA AGACTTAGCT TCAAATCCCT ACTCCTTCAC TTACfAATl T TGTGATTTGG 240 
AAATATCCGC GCAAGATGTT GACGTTGCAG ACTTGMiTAG TGCAAGCCTT GTrTATTTTC 300 
45 CICACCACTG AATCTACAGG TGAAOTCTA GATCCATGTG GTTATATCAG TCCTGAATCT 360 
CCAGTTGTAC AACTTCATTC TAATTTCACT GCAGTrTGTG TGCTAAAGGA AAAATGTATG 420 
GATTATTTTC ATGTAAATGC TAATTACATT GTCTGGAAAA CAAACCATTT TACTATTCCT 480 
AAGGAGCAAT ATACTATCAT AAACAGAACA GCATCCAGTG TCACCTTTAC AGATATAGCT 540 
TCATTAAATA TTCAGCTCAC TTGCAACATT CTTACATTCG GACAGCTTGA ACAGAATGTT 600 
5 0 TATGGAATCA CAATAATTTC AGGCTTGCCT CCAGAAAAAC CTAAAAATTT GAGTTGCATT 660 

GTGAACGAGG GGAAGAAAAT GAGGTGTG AG TGGGATGGTG GAAGGGAAAC ACACTTGGAG 720 
ACAAACTTCA CTTTAAAATC TGAATGGGCA ACACACAAGT TTGCTGATTG CAAAGCAAAA 780 
CGTGACACCC CCACCTCATG CACTGTTGAT TATTCTACTG TGTATTTTGT CAACATTGAA B40 
GTCTGGGTAG AAGCAGAGAA TGCCCTTGGG AAGGTTACAT CAGATCATAT CAATTTTGAT 900 
5 5 CCTGTATATA AAGTGAAGCC CAATCCGCCA CATAATTTAT CAGTGATCAA CTCAGAGG AA 960 
CTGTCTAGTA TCTTAAAATT G ACATGG ACC AACCCAAGTA TTAAGAGTGT TATAATACTA 1 020 
AAATATAACA TTCAATATAG GACCAA AGAT GCCTCA ACTT GG AGCCAGAT TCCTCCTGAA 1080 
GACACAGCAT CCACCCGATC TTCATTCACT GTCCAAGACC TTAAACCTTT TACAGAATAT 1 140 
GTGTTTAGGA TTCGCTGTAT GAAGGAAGAT GGTAAGGGAT ACTGGAGTGA CTGGAGTGAA 1200 
60 GAAGCAAGTG GGATCACCTA TGAAG ATAG A CCATCTA AAG CACCAAGTTT CTGGTATAAA ] 260 
ATAGATCCAT OCCATACTCA AGGCTACAGA ACTGTACAAC TCGTGTGGAA G ACATTGCCT 1320 
CCTTTTGAAG CCA ATGGAAA AATCTTGGAT TATGAAGTGA CTCTCACAAG ATGGAAATCA 1 380 
CATTTACAAA ATTACACAGT TAATGCCACA AAACTGACAG TAAATCTCAC AAATGATCGC 1440 
TATCTAGCAA CCCTAACAGT AAGAAATCTT GTTGGCAAAT CAGATGCAGC TGTTTTAACT 1500 
65 ATCCCTGCCT GTGACTTTCA AOCTACTCAC CCTGTAATGG ATCTTAAAGC ATTCCCCAAA 1560 
GATAACATGC TTTGGGTGGA ATGGACTACT CCAAGGGAAT CTGTAAAGAA ATATATACTT 1620 
GAGTGGTGTG TGTTATCAGA TAAAGCACCC TGTATCACAG ACTGGCAACA AGAAGATGGT 1 680 
ACCGTGCATC GCACCTATTT AAGAGGGA AC TTAGCAG AG A GCAAATGCTA TTTGATAACA 1 740 
GTTACTCCAG TATATGCTGA TGGACCAGG A AGCCCTGAAT CCATAAAGGC ATACCTTA AA 1 800 
70 CAAGCTCCAC CTTCCAAAGG ACCTACTGTT CGGACAAAAA AAGTAGGGAA AAACGAAGCT 1860 
GTCTTAGAGT GGGACCAACT TCCTGTTGAT GTTCAGAATG GATTTATCAG AAATTATACT !920 
ATATTTTATA GAACCATCAT TGG AAATGAA ACTGCTGTG A ATGTGGATTC TTCCCACACA 1 9S0 
GAATATACAT TGTCCTCTTT GACTAGTGAC ACATTGTACA TGGTACGAAT GGCAGCATAC 2040 
ACAGATGAAG GTGGGAAGGA TGGTCCAGAA TTCACTTTTA CTACCCCAAA GTTTGCTCAA 21 00 



75 GGAGAAATTG AAGCCATAGT CGTGCCTGTT TGCTTAGCAT TCCTATTGAC AACTCTTCTG 2 1 60 
GGAGTGCTGT TCTGCTTTAA TAAGCGAGAC CTAATTAAAA AACACATCTG GCCTAATGTT 2220 
CCAGATCCTT CAAAGAGTCA TATTGCCCAG TGGTCACCTC ACACTCCTCC AAGGCACAAT 22S0 
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TTTAATTCAA AAGATCAAAT GTATTCAGAT GOCAATTTCA CTGATGTAAG TGTTGTGGAA 2340 
ATAGAAGCAA ATGACAAAAA GCCTTTTCCA GAAGATCTGA AATCATTGGA CCTGTTCAAA 2400 
AAGGAAAAAA TTAATACTGA AGGACACAGC AGTGGTATTG GGGGGTCTTC ATGCATGTCA 2460 
TCTTCTAGGC CAAGCATTTC TAGCAGTGAT GAAAATGAAT CTTCACAAAA CACTTCGAGC 2520 
ACTGTCCAGT ATTCrACCGT GGTACACAGT GGCTACAGAC ACCAAGTTCC GTCAGTCCAA 25S0 
GTCTTCTCAA GATCCGAGTC TACCCAGCCC TTGTTAGATT CAGAGGAGCG GCCAGAAGAT 2640 
CTACAATTAG TAGATCATGT AGATGGCOGT GATGGTATTT TGCCCAGGCA ACAGTACTTC 2700 
AAACAGAACT GCAGTCAGCA TGAATCCAGT CCAGATATTT CACATTTTGA AAGGTCAAAG 2760 
CAAGTTTCAT CAGTCAATGA GGAAGATTTT GTTAGACTTA AACACCAGAT TTCAGATCAT 2S20 
ATTTCACAAT CCTGTGGATC TGGGCAAATG AAAATGTTTC AGGAAGTTTC TGCAGCAGAT 2S80 
GCTTTTGGTC CAGGTACTGA GGGACAAGTA CiAAAGATTTG AAACAGTTGG CATGGAGGCT 2940 
GCGACTGATCi AAGGCATGCC TAAAAGTTAC TTACCACAGA CTGTACGGCA AGGCGGCTAC 3000 
ATGCCTCAGTGAAGGACTAG TAGTTCCTGC TACAACTTCA GCAGTACCTA TAAAGTAAAG 3060 
CTA AAATGAT TTTATCTGTG AATTC 



1 11 21 31 41 51 

MLTLQTWVVQ ALFIFLTTES TGELLDFCGY ISPESPVVQL HSNFTAVCVL KEKCMDYFHV 60 
NANY1VWKTN HFTIPKEQYT IINRTASSVT FTDIASLNIQ LTCN1LTFGQ LEQNVYG1TI 120 
iSGLPPEKPK NLSC1VNEGK KMRCEWDGGR ETHLETNFTL KSEWATHKFA DCKAKRDTPT 1 80 
SCTVDYSTVY FVNIEVWVEA ENALGK VTSD HJNFDPVYKV KPNPPHNLSV JNSEELSSIL 240 
KLTWTNPSIK SVIILKYNIQ YRTKDASTWS QIPPEDTAST RSSFTVQDLK PFTEYWRIR 300 
CMKEDGKGYW SDWSEEASGl TYEDRPSKAP SFWYKIDPSH TQGYRT VQLV WKTLPPFEAN 360 
G10LDYEVTI, TR WKSW.QNY T VNATKLTVN LTODRYIJVTL TVRNLVGKSD AAVLTIPACD 420 
FQATHPVMDL KAFPKDNMLW VKWTTPRESV KKYILEWCVL BDKAPC1TDW QQEDGTVHRT 480 
, 1 RGNLA GSI Cs 1 ITVTJ'V"i AlXii GEPJ W KAYJ Kf}/ 1" S Ki JP1 VftTKKV GKNBAV • EV/D 540 
QLPVDVQNGF 1KNYTIFYKT 1IGN15TAVNV DSSHTEYTLS SLTSDTLYMV RMAAYTDEGG 600 
KDGPEFTFTT PKFAQGEIEA IWPVCLAFL LTTLLGVLFC FNKRDL1KKH IWPNVPDPSK 660 
SHIAQWSPHT PPRHNFNSKD QMYSDGNFTD VSVVEIEAND KKPFPEDLKS LDLFKKEKIN 720 
TEGHSSGIGG SSCMSSSRPS ISSSDENESS QNTSSTVQYS TVVHSGYRHQ VPSVQVFSRS 780 
ESTQPLLDSE ERPEDLQLVD HVDGGDGILP RQQYFKQNCS QHESSPD1SH PERSKQVSSV 840 
NEEDFVRLKQ QISDH1SQSC GSGQMKMFQE VSAADAFGPG TEGQVERFET VGMEAATDEG 900 
MPKSYLPQTV RQGGYMPQ 

Seq ID NO: 33 DNA sequence 

Nucleic Acid Accession * NMOI 8255. 1 

Coding sequence; ! 1-2491 (imderiined sequences correspond to start and stop cDdons) 

1 11 21 31 41 51 
111)!) 

AGTTGGCG AC ATGGTGGCAC CCGTGCTGGA G ACTTCTCAC GTGTTTTGCT GCCCA AACCG 60 
GGTGCGGGGA GTCCTGAACT GGAGCTCTGG GCCCAGAGGA CTTCTGGCCT TTGGCACGTC 120 
CTGCTCCGTG GTGCTCTATG ACCCCCTGA A AAQGGTTGTT GTTACCAACT/ TGAATGGTCA 1 80 
CACCGCCCGA GTCAATTGCA TACAGTGGAT TTGTAAACAG GATGGCTCCC CTTCTACTGA 240 
ATTAGTTTCT GGAGGATCTG ATAATCAAGT GATTCACTGG GAAATAGAGG ATAATCAGCT 30O 
TTTAAAAGCA GTGCATCTTC AAGGCCATGA AGGACCTGTT TATGCGGTGC ATGCTGTTTA 360 
CCAGAGGAGG ACATCAGATC CTGCATTATG TACACTGATC GTTTCTGCAG CTGCAGATTC 420 
TGCTOTTCGA CTCTGGTCTA AAAAGGGTCC AGAAGTAATG TGCCTTCAGA CTTTAAACTT 4SO 
TGGAAATGGA TTTGCTTTGG CTCTCTGCTT ATCTTTTTTG CCAAATACTG ATGTACCAAT 540 
ATTAGCATGT GGCAATG ATG ATTGCAGAAT TCACATATTT GCTCAACAAA ATGATCAGTT 600 
TCAGA AAGTG CTTTCTCTCT GTGGACATG A GGATTGG ATT AG AGGAGTGG AATGGGCAGC 660 
CTTTGGTAGA GATCTTTTCC TAGCAAGCTG TTCACAAGAT TGCCTGATAA GAATATGGAA 720 
GCTGTATATA AAGTCAACAT CTTTAGAAAC TCAGGATGAC GATAACATAA GACTGAAAGA 780 
AAATACTTTT ACCATAGAAA ATGAAAGTGT TAAAATAGCA TTTGCTGTTA CTCTGGAGAC 840 
AGTGCTAGCC GGTCATGAAA ACTGGGTAAA TGCAGTTCAC TGGCAACCTG TGTTTTACAA 900 
AGATGGTGTC CTACAGCAGC CAGTGAGATT ATTATCTGCT TCCATGGATA AAACCATGAT 960 
TCTCTGGGCT CCAGATGAAG AGTCAGGAGT TTGGCTAGAA CAGGTTCGAG TAGGTGAAGT 1020 
AGGTGGGAAT ACTTTGGGAT T'iTA'l'CiATTG CCAGTTCAAT GAAGATGGCT CCATGATCAT iORO 
TGCTCATGCT Tl'CCACGGAG CGTTGCACCT TTGGAAACAG AATACAG'ITA ACCCAAGAGA 1 140 
GTGGACTCCA GAGATTGTCA TTTCAGGACA CTTTGATGGT GTCCAAGACC TAGTCTGGGA 1200 
TCCAGAAGGA GAATTTATTA TCACTGTTGG TACTGATCAG ACAACTAGAC TTTTTGCTCC 1260 
ATGGAAGAGA AAAGACCAAT CACAGGTGAC TTGGCATGAA ATTGCAAGGC CTCAGATACA 1320 
TGGGTATGAC CTGAAATGTT TGGCAATGAT TAATCGGTTT CAGTTTGTAT CTGGAGCAGA 1380 
TGAAAAAGTT CTTCGGGTTT TTTCTGCACC TCGGAATTTT GTGGAAAATT TTTGTGCCAT 1440 
TACAGGACAA TCACTG AATC ATGTGCTCTG TAATCAAG AT AGTGATCTTC CAGAAGG AGC 1 500 
CACTGTCCCT GCATTGGGAT TATCAAATAA AGCTGTCTTT CAGGGAGATA TAGCTTCTCA 1560 
GCCTFCTG AT G A AG AGGAGC TGTTAACTAG TACTGGTTTT GAGTATCAGC AGGTGGCCTT 1 620 
TCAGCCCTCC ATACTTACTG AGCCTCCCAC TGAGGATCAT CTTCTGCAGA ATACTTTGTG S680 
GCCTGA AGTT CAAAA ACTAT ATGGGCACGG TT ATGAA ATA TTTTGTGTTA CTTGTAACAG 1 740 
TTCAAAGACT CTGCTTGCCT CAGCTTGTAA GGCAGCTAAG AAAGAGCATG CAGCTATCAT 1800 
TCTTTGGAAC ACTACATCTT GGAAACAGGT GCAGAATTTA GTTTTCCACA GTTTGACAGT 1860 
CACGCAGATG GCCTTCTCAC CTA ATGAGAA GTTCTTACTA GCTGTTTCCA GAGATCGAAC 1920 
CTGGTCATTG TGGAAAAAGC AGGATACAAT CTCACCTGAG TTCGAGCCAG TTTTTAGTCT 19B0 
TTTTGCCTTC ACCAACAAAA TTACTTCTGT GCACAGTAGA ATTATTTGGT CTTGTGATTG 2040 
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GAGTCCTGAC AGCAAGTATT TCTTCACTGG GAGTCGAGAC AAAAAGGTGG TTGTCTGGGG 2100 
TG AGTGCGAC TCCACTGATO ACTGTATTG A GCACAACATT GGCCCCTGCT CCTCAGTCCT 2 1 50 
GGACGTGGGT GGGGCTGTGA CAGCTGTCAG CGTCTGCCCA GTGCTCCACC CTTCTCAACG 2220 
ATACGTGGTT GCAGTAGG AT TGGAGTGTGG AAAGATTTGC TTATATACCT GGAAAAAGAC 2280 
TGATCAAGTT CCAGAAATAA ATG ACTGG AC CCACTGTGTA GAA ACAAGTC AAAGCCAAAG 2340 
TCATACACTG GCTATCAGAA AATTATGCTG GAAGAATTGC AGTGGAAAAA CTGAACAGAA 2400 
GG AAGCAGAA GGTGCTG AGT GGTTACACTT TGCAAGCTGT GGTGAAGATC ACACTGTGAA 2460 
GATACACAGA GTCAATAAAT QTGCACTGTAATGG 

Seq ID NO: 34 Protein sequence; 
Protein Accession*! NP_06072S.l 

1 11 21 31 41 51 

I I 1 I I I 

MVAPVLETSH VFCCPNRVRG VLNWSSGPRG LLAFGTSCSV VLYDPIXRVV VTNLNGHTAR 60 
VNCIQWICKQ DGSPSTELVS GGSDNQV1HW E1EDNQLLKA VHLQGHEGPV YAVHAVYQRR 120 
TSDPALCTU VSA AADSAVR LWSKKGPEVM CLQTLNFGNG FALALCLSFL PNTDVPILAC 1 SO 
GNDDCPJHIF AQQNDQFQKV LSLCGHEDWI RGVEWAAFGR DLFLASCSQD CLIRIWKLYI 240 
KSTSLETQDD DNIRLKENTF TIENESVKIA FAVTLETVLA GHENWVNAVH WQPVFYKDGV 300 
LQQPVRLLSA SMDKTMILWA PDEESGVWLE QVRVGEVGGN TLGFYDCQFN EDGSMIIAHA 360 
FHG ALHLWKQ NTVNPREWTP EIVISGHFDG VQDLVWDPEG EFI1TVGTDQ TTRLFAPWKR 420 
KDQSQVTWHE IARPQIHGYD LKCLAMIfTRF QFVSGADEKV LRVFSAPRNF VENFCAITGQ 480 
SLNHVLCNQD SDLPEGATVP ALGLSNKAVF QGDIASQPSD EEELLTSTGF EYQQVAFQPS 540 
ILTEPPTEDH LLQNTLWPEV QKLYGHGYEI FCVTCNSSKT LLASACKAAK KEHAAI1LWN 600 
TTSWKQVQNL VFHSLTVTQM AFSPNEKFLL AVSRDRTWSL WKKQDTISPE FEP VFSLFAF 660 
TNK1TSVHSR 1IWSCDWSPD SKYFFTGSRD KKVVVWGECD STDDCIEHNI GPCSSVLDVG 720 
GAVTAVSVCP VLHPSQRYVV AVGLECGKIC 1ATWKKTDQV PEINDWTHCV ETSQSQSHTL 780 
AIRKLCWKNC SGKTEQKhAB GAEWLHFASC GKDHTVKIHR VNKCAL 

Seq ID NO: 35 DNA sequence 

Nucleic Add Accession #: NM 02213 1 

Coding sequence: 1 1 -2878 (underlined sequences correspond to start and stop codons) 

t 11 21 31 41 51 

TGCTGCGAGG ATGCTGCCTG GGCGGCTGTG CTGGGTGCCG CTCCTGCTGG CGCTGGGCGT 60 
GGGGAGCGGC AGCGGCGGTG GCGGGGACAG CCGGCAGCGC CGCCTCCTCG CGGCTAAAGT 120 
CAATAAGCAC AAGCCATGGA TCGAGACTTC ATATCATGGA GTCATAACTG AG AACA ATGA 1 80 
CACAGTCATT TTGGACCCAC CACTGGTAGC CCTGGATAAA GATGCACCGG TTCCTFTTGC 240 
AGGGGAAATC TGTGCGTTCA AGATCCATGG CCAGGAGCTG CCCTTTGAGG CTGTGGTGCT 300 
CAACAAGACA TCAGGAGAGG GCCGGCTCCG TGCCAAGAGC CCCATTGACT GTGAGTTGCA 360 
GAAGGAGTAC ACATTCATCA TCCAGGCCTA TGACTGTOGT GCTGGGCCCC ACGAGACAGC 420 
CTGGAAAAAG TCACACAAGG CCGTGGTCCA TATACAGGTG AABGAWiX-'A ACGAGTTTGC 480 
TCCCACCTTC AAAGAGCCAG CCTACAAGGC TGTTGTGACG GAGGGCAAGA TCTATGACAG S40 
CATTCTGCAG GTGGAGGCCA TTGACGAGGA CTGCTCCCCA CAGT ACAGCC AGATCTGCAA 600 
CTATGAAATC GTCACCACAG ATGTGCCTTT TGCCATCGAC AGAAATGGCA ACATCAGGAA 660 
CACTGAGAAG CTGAGCTATG ACAAACAACA CCAGTATGAG ATCCTGGTGA CCGCCTACGA 720 
CTGTGGACAG AAGCCCGCTG CTCAGGACAC CCTGGTGCAG GTGGATGTGA AGCCAGTTTG 780 
CAAGCCTGGC TGGCAAGACT GGACCAAGAG GATTGAGTAC CAGCCTGGCT CCGGGAGCAT 840 
GCCCCTGTTC CCCAGCATCC ACCTGGAGAC GTGCGATGGA GCCGTGTCTT CCCTCCAGAT 900 
CGTCACAGAG CTGCAGACTA ATTACATTGG GAAGGGTTGT GACCGGGAGA CCTACTCTGA 960 
G A AATCCCTT CAGAAGTTAT GTGG ACCCTC CTCTGGC ATC ATTGACCTCT TGCCATCCCC 1 020 
TAGCGCTGCC ACCAACTGGA CTGCAGGACT GCTGGTGGAC AGCAGTGAGA TGATCTTCAA 1080 
GTTTG ACGGC AGGCAGGGTG CCAAAATCCC CGATGGGATT GTGCCCAAG A ACCTGACCGA 1140 
TCAGTTCACC ATCACCATGT GGATGAAACA CGGCCCCAGC CCTGGTGTGA GAGCCGAGAA 1200 
GGAAACCATC CTCTGCAACT CAGACAAAAC CGAAATGAAC CGGCATCACT ATGCCCTGTA 1260 
TGTGCACAAC TGCCGCCTCG TCTTTCTCTT GCGGA AGGAC TTCGACCAGG CTGACACCTT 1320 
TCGCCCCGCG GAGTTCCACT GGAAGCTGGA TCAGATTTGT GACAAAGAGT GGCACTACTA 1380 
TGTCATCAAT GTGGAGTTTC CTGTGGTAAC CTTATACATG GATGGAGCAA CATATGAACC 1440 
ATACCTGGTG ACCAACGACT GGCCCATTCA TCCATCTCAC ATAGCCATGC AACTCACAGT 1500 
CGGCGCTIGT TGGCAAGGAG GAGAAGTCAC CAAACCACAG TTTGCTCAUT TCTTI'CATGG 1 560 
AAGCCTGGCC AGTCfCACCA TCCGCCCTGG CAAAATGGAA AGCCAOAAGG TGATCTCCTG 1620 
CCTGCAGGCC TGCAAGGAAG GGCTGGACAT TAATTCCTTG GAAAGCCTTG GCCAAGGAAT 1680 
AAAGTATCAC TTCAACCCCT CGCAGTCCAT CCTGGTGATG GAAGGTGACG ACATTGGGAA 1740 
CATTAACCGT GCTCTCCAG A A AGTCTCCTA CATC A ACTCC AGGCAGTTCC CA ACGGCGGG 1 S00 
TGTGCGGCGC CTCAAAGTAT CCTCCAAAGT CCAGTGCTTT GGGGAAGACG TATGCATCAG 1 860 
TATCCCTG AG GTAGATGCCT ATGTGATGGT CCTGCAGGCC ATCG AGCCCC GGATCACCCT 1 920 
CCGGGGCACA GACCACTTCT GG AGACCTGC TGCCCAGTTT GAAAGTGCCA GGGGAGTGAC 1980 
CCTCTTCCCT G AT ATCAAG A TTGTG AGCAC CTTCGCCAAA ACCGAAGCCC CCGGGGACGT 2040 
GAAAACCACA GACCCCAAAT CAGAAGTCTT AGAGGAAATG CTTCATAACT TAGATTTCTG 2100 
TGACATTTTG GTGATCGGAG GGGACTTGGA CCCAAGGCAG GAGTGCTTGG AGCTCAACCA 2160 
CAGTGAGCTC CACCAACGAC ACCTGGATGC CACTAATTCT ACTGCAGGCT ACTCCATCTA 2220 
CGGTGTGGGC TCCATGAGCC GCTATGAGCA GGTGCTACAT CACATCCGCT ACCGCAACTG 2280 
GCGTCCGGCT TCCCTTGAGG CCCGGCGTTT CCGGATTAAG TGCTCAGAAC TCAATGGGCG 2340 
CTACACTAGC AATGAGTTCA ACTTGGAGGT CAGCATCCTT CATG AAGACC AAGTCTCAGA 2400 
TAAGGAGCAT GTCAATCATC TGATTGTGCA GCCTCCCTTC CTCCAGTCTG TCCATCATCC 2460 
TGAGTCCCGG AGTAGCATCC AGCACAGTTC AGTGGTCCCA AGCATTGCCA CAGTGGTCAT 2520 
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CATCATCTCC GTGTGCATGC TTGTGTTTGT CGTGGCCATG GGTGTGTACC GGGTCCGGAT 2580 
CGCCCACCAG CACTTCATCC AGGAGACTGA GGCTGCCAAG G AATCTG AGA TGGACTGGGA 2640 
CGATTCTGCG CTGACTATCA CAGTCAACCC CATGGAGAAA CATGAAGGAC CAGGGCATGG 2700 
GGAAGATGAG ACTGAGGGAG AAGAGGAGGA AGAAGCCGAG GAAGAAATGA GCTCCAGCAG 2760 
5 TGGCTCTGAC GACAGCGAAG AGGAGGAGGA GGAGGAAGGG ATGGGCAGAG GCAGACATGG 2820 
GCAGAATGGA GCCAGGCAAG CCCAGCTGGA OTGGGATGAC TCCACCCTCC CCTACTAGTG 2SB0 
CCCAGGGGTC TGCTGCCTGG CCCACATGTC CCTTTTGTAA ACCCTGACCC AGTGTATGCC 2940 
CATGTCTATC ATACCTCACC TCTGATGTCT GTGACATGTC TGGGAAGGCC TTCTCCAGCT 3000 
TCCTGGAGCC CACCCTTTAA GCCTTGGGCA CTCCCTGTGT TTCATCCATG GGGAAGTTCC 3060 

1 0 ' AAGAAGCCCA GCATGGCCAT CAGTGAGGAC TTCAGGGTAG ACTTTGTCCT GTAGCCTCCA 3120 
CTTCTGCCCT AAGTTCCCCA GCATCCTGAC TAC'CTGTC'L'G CAGAGTTTGC CTTTGTTnT 3 1 30 
TCCTGCAGGG AAGAAGUCCC ACC H iG ! UT CACTCACCTC CCCAGGCTCA GAGTCCCCAA 3240 
GGCCCTGGGG TTCCAACTCA CTGTGCGTCT CCTCCACACA GACCAGTAGG TTCTCCTATG 3300 
CTGACTCCAG GTTGCTTCAT ACAAGGAGGG TGGTTGAACT TCACACACGT AAGGTCTTAG 3360 

1 5 TGCTTAACAG TTTAAAGGAA AGTCCITGTT GAGGCAGAAC TAAGTTTACA GGGAAAGGTA 3420 
CACACATTCT CTCTCTCTCT CTCTCTCTGT CTATCTAGTT CCCCAGCTTG GAGAGCCTTT 3480 
CCCCTTGCTT CTTTCTGAGG CCATATAAGC TTATAAGAAA AGTCCCAAAC CAAGAATAGG 3540 
TCCTTGGCC A C AAGCAGGGT CTGATCCCCC ATCAGAGCTA TCTGAGCCTG CCTGTCTGGG 3600 
CACCTGCTGC AACCATGCAG CTACCCTGCC AGGGGCACTC AGCAAACAGA ACCACAGGGC 3660 

20 CCAGGAGGCA TTCCACACAG GCACTGCCCC AGGACAACAC AACAAGGACA GTCACAACAA 3720 
GGACAACAAG GACACAACAC AACACACAAC AAGGACAGTC ACAACAAGCC TAGAGCCAGA 3780 
AAGCAGATGG AAATGCTAAT GAGGTCAAAC GTAGGCTTCA TGGTGGGTGG AGTGGGGGTG 3340 
GCTGGGCTCC CCCAGGACAG AGGGGACCCT GAGGTTGGCA AGGCTCTCAC CACTCAGCCT 3900 
TATGGTCCCT TATCTCCTAT CTTCCCTCTT GAGAAAATAC ACGCTTTCTG CATGTATTAG 3960 

25 AA ACGCACG A GCTCCACC AA GTCTACAATG AAAGTTTG A A ATTTAACTGC AAGG AATTAG 4020 
AAGCATATTT GCAATCATTG CAGCTTCTTC TTTCTTCTGG TCATAAAAGG AGGAACACTT 4080 
TAGATAGAGG GCAAATATAT CTGAAAACCT AATTTCTTTC TTJTTTTG AT A AGG AAATCT 4140 
TTTCCATCJTC CA1 CCTAACA TGCACAACCT GTGAAGAGA A TTGTTTCTAT AGTAACTGGT 42C0 
CTGTOATCTT ITG'IGGCCAA GAGAATAGCA GGCAAGAATT AGGGCC1TGA CAGAA'ITTCC 4260 

3 0 ACGAAGCTCT GAGAACATGT TTGTTTCGAA TGTCTGATTC CTCTTTGTCA TCAATGTGTA 4320 
TGCTCTGTCC CCATCCTTCA CTCCTCCTCA AGCTCACACC AATTGGTTTG GCACAGGCAC 4380 
AGAGCTGGTC CCTAGTTAAG TGGCATTTAT GTTAAAAAAA AATAGTTCAG AATCTCAGCC 4440 
TTTTCTTTGT GTCATCAAAA CAGCTTAAGA AGGGGACTAC TGCCAATGTC CTCTAGTCTG 450O 
ACCTCCACCC AGGGAGGACC CATGGCAGGT CTTTTCAACT TTCTGATTCA TGAGAACAAC 4560 

3 5 CTTGTG A AGC TTTTCCCACC TCCTAAAGTG TTTTCTGCAT CTGTTCCTTC CTTTGGACCT 4620 

CACAACAAAT CCTGTGAAGT AACTGAGACA TCTGTTGTTA GATACATTTT TGTGATGAGT 4680 
AAACTGAGGC TTCG 

■ Seq ID NO: 36 Protein sequence; 
40 Protein Accession* NP_071414.t 

1 11 21 31 41 51 
I 1 I I I I 

Ml PGRLCWW ULALGVGSG SGGGGDSRQR RLLAAKVNKH KPWiUTSYHH VtTENNDTVI 60 
45 LDPPLVALDK DAPVPFAGEI CAFKIHGQEL PFEAVVLNKT SGEUUI.RAKK 1'IUCELQKEY 120 
TFJIQAYDCG AGPHETAWKK SHKAWHIQV KDVNEFAPTF KDI'AYK AWT BGKIYDS1LQ 180 
VEAIDEDCSP QYSQICNYEI VTTDVPFAiD RNGN1RNTEK LSYDKQHQYE ILVTAYDCGQ 240 
KPAAQDTLVQ VDVKPVCKPG WQDWTKRIEY QPGSGSMPLF FSIHLETCDG AVSSLQIVTE 300 
LQTNYIGKGC DRETYSEKSL QKLCGASSGI IDLLPSPSAA TNWTAGLLVD SSEMIFKFDG 360 
50 RQG AKIPDGI VPKNLTDQFT ITMWMKHGPS PGVRAEKETI LCNSDKTEMN RHHY ALYVHN 420 
CRLVFLLRKD FDQADTFRFA EFHWKLDQIC DKEWHYYVIN VEFFVVTLYM DGATYEPYLV 480 
TNDWPIHPSH IAMQLTVGAC WQGGEVTKPQ FAQFFHGSLA SLTffiPGKME SQKVISCLQA 540 
CKEGL.DINSL ESLGQGDCYH FNPSQS1LVM EGDDIGNINR ALQK.VSYINS RQFPTAG VRR 600 
LKVSSKVQCF GED VCISIPE VDAYVMVLQA 1EPR1TLRGT DHFWRPAAQF ESARGVTLFP 660 
5 5 DIKIVSTFAK. TEAPGDVKTT DPKSEVLEEM LHNLQFCDE. VIGGDLDPRQ ECLELNHSEL 720 

HQRHLDATNS TAGYSIYGVG SMSRYEQVLH HIRYRNWRPA SLEARRFRK CSELNGRYTS 780 
NEFNLEVSIL HEDQVSDKEH VNHLIVQPPF LQSVHHPESR SSIQHSSVVP SIATVVIflS B40 
VCMLVFVVAM GVYRVRIAHQ HFIQETEAAK ESEMDWDDSA LT1TVNPMEK HEGPGHGEDE 900 
TEGBEEEEAE EEMSSSSGSD DSEEEEEEEG MGRGRHGQNG ARQAQLEWDD STLPY 

60 

Seq ID NO: 37 DNA sequence 

Nudeic Acid Accession ft none Found 

Coding sequence: ] 43-874 (underlined sequences correspond tD slari and stop codons) 

65 1 11 21 31 41 51 
I I I I I I 

GGG AGG GAGA GAGGCGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 60 
CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 120 
CGCTGCCCGG CAGCCGGGAG CCATGCGACC CCAGGGCCCC GCCGCCTCCC CGCAGCGGCT ISO 

7 0 CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCDTCGAGCG CCTCTGAGAT 240 

CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GCAGAGGGAG GTGGTGGACC TGTATAATGG 300 
AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TG GTCGAG AC GGGAGCCCTG GGGCCAATGG 360 
CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGATGGATTC AAAGGAGAAA AGGGGGAATG 420 
TCTGAGGGAA AGCTTTGAGG AGTCCTGG AC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 480 

75 ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 540 
AAATAGTGCT CTAAG AGTTT TGTTCAGTGG CTCACTTCGG CTAAAATGCA GA AATGCATG 600 
CTGTCAGCGT TGGTATTTCA CATTCAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 660 
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AGCTATAATT tatttgg acc a aggaagccc tg a AATGAAT TCAACAATTA ATATTCATCG 720 
CACTTCTTCT GTGGAAGGAC TTTGTG A AGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 780 
CTGGGTTGGC ACTTGTTCAG ATTACCCA AA AGGAGATGCT TCTACTGGAT GGAATTCAGT 840 
TTCTCGCATC ATTATTG AAG AACTACCAAA ATAAATGCTT TAATTTTCAT TTGCTACCTC 900 
TTTTTTrATT ATGCCTTGGA ATGGTTCACT TAAATG ACAT TTTAAATA AG TTTATGTATA 960 
CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACACTGTT 1020 
TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTTTTAGT 1080 
TGGTTAGAAT ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTGTTGT 1 140 
GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TTTTAAAAAA ATATAAAAGC TACCAATCTT 1200 
TGTACAATTT GTAAATGTTA AG AATTTTTT TTATATCTGT TAAATAAAAA TTATTTCCAA 1260 
CAACCTTAAA AAAAAAAAAA AAAA 



MRPQGPAASP QRLRGLLLLL LLQLPAPSSA SEIPKGKQKA QLRQREVVDL YNGMCLQGPA 60 
GVPGRDGSPG ANG1PGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 120 
20 GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL FIEAIIYLDQ 180 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSRI1IEE 240 
LPK 

Seq ID NO: 39 DNA sequence 
25 Nucleic Acid Accession*: NM 000949 

Coding sequence: 285-2 1 53(underlined sequences con-espond to start and stop codons) 

1 11 21 31 41 51 

30 . Uag'gctgaaatccccagacgccggttttctgggctgggctttctgcttactcactcctt 60 

CTCCCTCTTT CTGGATTTTA CCGACCGTTC GCGAAACAGC TTTCCACACA ATGGAGCTTC 120 
ATGTCCTCGT GCAGGAAGTA CTCATCGACT GATGTGGCAG ACTTTGCTCC CTGACAAAAC 1 80 
TAAAGAACTC TCCTATTCAT GGAGGCGAAC ACTGAGGATG CTTTCCACAT GAACCCTGAA 240 
□TGAACTTCT GATACATTTC CTGCAGCAAG AGAAGGCAGC CAACATGAAG GAAAATGTGG 300 

35 CATCTGCAAC CGTTTTCACT CTGCTACTTT TTCTCAACAC CTGCCTTCTG AATGGACAGT 360 
TACCTCCTGG AAAACCTGAG ATCTTTAAAT GTCGTTCTCC CAATAAGGAA ACATTCACCT 420 
GCTGGTGGAG GCCTGGGACA GATGGAGGAC TTCCTACCAA TTATTCACTG ACTTACCACA 480 
GGGAAGGAGA GACACTCATG CATGAATGTC CAGACTACAT AACCGGTGGC CCCAACTCCT 540 
GCCACTTTGG CAAGCAGTAC ACCTCCATGT GGAGGACATA CATCATGATG GTCAATGCCA 600 

40 CTAACCAGAT GGGAAGCAGT TTCTCGGATG AACTTTATGT GGACGTGACT TACATAGTTC 660 
AGCCAGACCC TCCTTTGG AG CTGGCTGTGG AAGTAA AACA GCCAGA AGAC AG AAAACCCT 720 
ACCTGTGGAT TAAATGGTCT CCACCTACCC TGATTGACTT AAAAACTGGT TGGTTCACGC 7S0 
TCCTGTATGA AATTCGATTA AAACCCGAGA AAGCAGCTGA GTGGGAGATC CATTTTGCTG 840 
GGCAGCAAAC AGAGTTTAAG AT I CTCAGCC TACATCCAGG ACAGAAATAC CTTGTCCAGC 900 

45 i T rG< iCAAACCA CCATCG I ACTGGA GTGCATGGAG I'D X K , U T M1UGA SCO 
TACCTAGTGA CTTCACCATG AATGATACAA CCGTGTGG AT CTCTGTGGCT GTCCTTTCTG 1 020 
CTGTCATCTG TTTGATTATT GTCTGGGCAG TGGCTTTGAA GGGCTATAGC ATGGTGACCT 1080 
GCATCTTTCC GCCAGTTCCT GGGCCAAAAA TAAAAGOATT TGATGCTCAT CTGTTGGAGA 1140 
AGGGC AAGTC TGAAGAACTA CTGAGTGCCT TGGG ATGCCA AGACTTTCCT CCCACTTCTG 1 200 

5 0 ACTATGAGG A CTTGCTGGTG G AGTATTTAG AAGTAGATGA TAGTGAGGAC CAGCATCTAA 1 260 
TGTCAGTCCA TTCAAAAGAA CACCCAAGTC AAGGTATGAA ACCCACATAC CTGGATCCTG 1320 
ACACTGACTC AGGCCGGGGG AGCTGTGACA GCCCTTCCCT TTTGTCTGAA AAGTGTGAGG 1380 
A ACCCCAGGC CAATCCCTCG ACATTCTATG ATCCTGAGGT CATTGAGAAG CCAGAGAATC 1 440 
CTGAAACAAC CCACACCTGG GACCCCCAGT GCATAAGCAT GGAAGGCAAA ATCCCCTATT 1500 

5 5 TTCATGCTGG TGG ATCCAAA TGTTCAACAT GGCCCTTACC ACAGCCCAGC CAGCACAACG 1 560 
CCAGATCCTC TTACCACAAT ATTACTGATG TGTGTGAGCT GGCTGTGGGC CCTGCAGGTG 1620 
CACCGGCCAC TCTGTTGAAT GAAGCAGGTA AAGATGCTTT AAAATCCTCT CAAACCATTA 1 680 
AGTCTAGAGA AGAGGGAAAG GCAACCCAGC AGAGGGAGGT AGAAAGCTTC CATTCTGAGA 1740 
CTGACCAGGA TACGCCCTGG CTGCTGCCCC AGGAGAAAAC CCCCTTTGGC TCCGCTAAAC 1 800 

60 CCTTGGATTA TGTGGAGATT CACAAGGTCA ACAAAGATGG TGCATTATCA TTGCTACCAA 1860 

AACAGAGAGA GAACAGCGGC AAGCCCAAGA AGCCCGGGAC TCCTGAGAAC AATAAGGAGT JWO 

fGCCA CCGGGGTC A'l T ICCTI H'TGGTI i OATCCACATO 19» 

CTAAAAACGT GGCTTGCTTT GAAGAATCAG CCAAAGAGGC CCCACCATCA CTTGAACAGA 2040 
ATCAAGCTGA GAAAGCCCTG GCCAACTTCA CTGCAACATC AAGCAAGTGC AGGCTCCAGC 2100 

65 TGGGTGGTTT GGATTACCTG GATCCCGCAT GTTTTACACA CTCCTTTCAC TGATAGCTTG 2160 
ACTAATGGAA TGATTGGTTA AAATGTGATT TTTCTTCAGG TAACACTACA GAGTACGTGA 2220 
AATGCTCAAG AATGTAGTCA GACTGACACT ACTAAAGCTC CCAGCTCCTT TCATGCTCCA 2280 
TTTTTAACCA CTTGCCTCTT TCTCCAGCAG CTGATTCCAG AACAAATCAT TATGTTTCCT 2340 
AACTGTGATT TGTAGATTTA CTTTTTGCTG TTAGTTATAA AACTATGTGT TCAATGAAAT 2400 

70 AAAAGCACAC TGCTTAGTAT TCTTGAGGGA CAATGCCAAT AGGTATATCC TCTGGAAAAG 2460 
QCTTTCATGA TTTGGCATGG GACAGACGGA AATGAAATTG TCAAAATTGT TTACCATAGA 2520 
AAGATGACAA AAGAAAATTT TCCACATAGG AAAATGCCAT GAAAATTGCT TTTGAAAAAC 2SB0 
AACTGCATA A CCTTTACACT CCTCGTCCAT TTTATTAGG A TTACCCAAAT ATA ACCATTT 2640 
AAAOAAAGAA TGCATTCCAG AACAAATTGT TTACATAAGT TCCTATACCT TACTGACACA 2700 

75 TTGCTGATAT GCAAGTAAGA AAT 

Seq ID NO: 40 ; 



WO 02/059377 



Protein Accession #: NP„0O094D,l 
I 11 21 31 41 51 

MSCENVAS ATV FTLLLFLNTC LLNGQLPPGK PEIFKCRSPN KETFTCWWRP GTDGGLPTNY 60 
SLTYHREGET LMHECPDY1T GGPNSCHFGK QYTSMWRTYI MMVNATNQMG SSFSDELYVD 120 
VTYIVQPDPP LELAVEVKQP EDRKPYLWIK WSPPTLIDLK TGWFTLLYEI RLKPEKAAEW 1 80 
E1HFAGQQTE FKILSLHPGQ KYLVQVRCKP DHGYWSAWSP ATFIQIPSDF TMNDTTVWIS 240 
VAVLSAVICL UVWAVALKG YSMVTC1FPP VPGPKIXGFD AHLLEKGKSE ELLSALGCQD 300 
FPPTSDYEDL LVEYLEVDDS EDQHLMSVHS KEHPSQGMKP TYLDPDTDSG RGSCDSPSLL 360 
SEKCEEPOAN PSTFYDPEVI EKPENPETTH TWDPQCISME GK1PYFHAGG SKCSTWPLPQ 420 
P5QHNPRSSY HN1TDVCELA VGPAGAPATL LNEAGKDALK SSQT1KSREE GKATQQREVP, 480 
SFHSETDQDT PWLLPQEKTP FGSAKPLDYV E1HKVNKDGA LSLLPKQREN SGKPKKPGTP 540 
ENNKEYAKVS GVMDNNILVL VPDPHAKNVA CFEESAKEAP PSLEQNQAEK ALANFTATSS 600 
KCRLQLGGLD YLDPACFTHS FH 

Seq ID NO: 41 DNA sequence 

Nucleic Acid Accession it; none found, Eos cloned sequence 

Coding sequence: 1-1572 (underlined sequences correspond to start and stop codons) 

! 11 21 3! 41 51 

ATOACCCAAA ATAAATTAAA GCTITGTTCC AAAGCCAATG TGTATACTGA AGTGCCTGAT 60 
GGAGGATGGG GCTGGGCGGT AGCTGTTTCA TTTTTCTTCG TTGAAGTCTT CACCTACGGC 120 
ATCATCAAGA CATTTGGTGT CTTCTTTAAT GACTTAATGG ACAGTTTTAA TGAATCCAAT 180 
AGCAGG ATCT CATGGATAAT CTCA ATCTGT GTGTTTGTCT TAACATTTTC AGCTCCCCTC 240 
GCCACAGTCC TGAGCAATCG TTTCGGACAC CGTCTGGTAG TGATGTTGCi({ GGGGCTACTT 300 
GTCAGCACCG GGATGGTGGC CGCCTCCTTC TCACAAGAGG TTTCTCATAT GTACGTCGCC 360 

GCATCATCTCT rCTGGGATACl [TTAGTTTTC TCC \CTGT A f T<" T at 
TCACAATATT TTGGCAAAAG ACGTTCCATA GTCACTGCAG TTGCTTCCAC AGGAGAATGT 4S0 
TTCGCTGTGT TTGCTTTCGC ACCAGCAATC ATGGCTCTGA AGGAGCGCAT TGGCTGGAGA 540 
TACAGCCTCC TCTTCGTGGG CCTACTACAG TTAAACATTG TCATCTTCGG AGCACTGCTC 600 
AGACCCATCT TTATCAGAGG ACCAGCGTCA CCGAAAATAG TCATCCAGGA AAATCGGAAA 660 
GAAGCGCAGT ATATGCTTGA AAATGAGAAA ACACGAACCT CAATAGACTC CATTGACTCA 720 
GGAGTAGAAC TAACTACCTC ACGTA A AAAT GTGCCTACTC ACACTAACCT GGAACTGGAG 780 
CCGAAGGCCG ACATGCAGCA GGTCCTGGTG AAGACCAGCC CCAGGCCAAG CGAAAAGAAA MO 
GCCCCGCTAT TAGACTTCTC CATTTTGAAA G AGAAAAGTT TTATTTGTTA TGCATTATTT 900 
GGTCTGTTTG CAACACTGGG ATTCTTTGCA CCTTCCTTGT ACATCATTCC TCTGGGCATT 960 
AGTCTGGGCA TTGACCAGGA CCGCGCTGCT TTTTTATTAT CTACGATGGC CATTGCAG AA 1 020 
GTTTTCGGAA GGATCGGAGC TGGTTTTGTC CTCAACAGGG AGCCCATTCG TAAGATTTAC 1080 
ATTGAGCTCA TCTGCGTCAT CTTATTGACT GTGTCTCTGT TTGCCTTTAC TTTTGCTACT 1 140 
GAATTCTGGG GTCTAATGTC ATGCAGCATA TTTTTTGGGT TTATGGTTGG AACAATAGGA 1200 
GGGACTCACA TTCCACTGCT TGCTGAGGAT GATGTCGTGG GCATTGAGAA GATGTCTTCT 1260 
GCAGCTGGGG TCTACATCTT CA'ITCAGAGC ATAGCAGGAC TGGCTGGACC GCCCCTTGCA 1320 
GGTTTGTTGG TGGACCAA AO TAAOATCTAC AGCAGGGCCT TCTACTCCTG CGCAGCTGGC £380 
ATGGCCCTGG CTGCTGTGTG 1 I i I i GTGAGACCGT GTAAGATGGG ACTGTGCCAG 1440 
CATCATCACT CAGGTGAAAC AAAGGTAGTG AGCCATCGTG GGAAGACTTT ACAGGACATA 1500 
CCTGAAGACT TTCTGGAAAT GGATCTTGCA AAAAATGAGC ACAGAGTTCA CGTGCAAATG 1560 
GAGCCGGTAT GA 

Seq iD NO; 42 Protein sentence: 

Protein Accession #: none found, Eos cloned sequence 

1 11 21 31 41 51 

I I I i 1 ! 

MTQNKLKLCS KANVYTEVPD GGWGWAVAVS FFFVEVFTYG IIKTFGVFFN DLMDSFNESN 60 
SRJSWIISIC VFVLTFSAPL ATVLSNRFGH FXVVMLGGLL VSTGMVAASF SQEVSHMYVA 120 
IGHSGLGYC FSFLPT VTIL SQYFGKRRSI VTAV ASTGEC FAVFAFAPAI MALKERIGWR 1 80 
YSLLFVGLLQ LNIV1FGALL RFIFIRGPAS PKIVIQENRK EAQYMLENEK TRTS1DS1DS 240 
GVELTTSPKN WTHTNLELE PKADMQQVLV KTSPRPSEKK APIXDFSILK EKSFICYALF 300 
GLFATf.GFFA PSLY 1 J L I 1 1 LLSTM E P iGFV 1,1 PIRKI 
i! I \i I iir U 1 IS s j 1 I u LAED DVVGIEKjM 

AAGVYIFfQS IAGLAGPPlA GLLVDQSKIY SRAFYSCAAG MALAAVCLAL VRPCKMGLCQ 480 
HHHSGETKVV SFIRGKTLQDI PEDFLEMDLA KNEHRVHVQM EPV 

Seq ID NO: 43 DNA sequence 

Nucleic Acid Accession ft FGENESH predicted ORF 

Coding sequence: 1-1749 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 

ATGCTGTCTG GCTTCTTGAT GAGTCCCAGT ACCCAGCACA GAGCACAGTA CACTCCCGGA 60 
GGAAAGAAAC TTCCGTGGGA GGCTTCCATC GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 120 
GACCGGGAGA GGGAGAGCCG GCCGGAGGCT GCCGGGCTCC TGTGGGACCG CGCTGCAGCC 180 
GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG CCGCCCGCCT GGATCCGCGC CCAGCAGCAG 240 
CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC GGG ACTGCGG CTGGGG GCGC GCAGG ACCCT 300 
CGCCTGCGTC CTGGACGTTC CCGGGGGAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 360 
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TCCGGACGAC AOCCCCCGGG GCCTTCTGAC TGCATCCCGA GATTTCCATC AGCGAGTGCA 420 
ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA TGGCTTAGGA 480 
GCTCCTGGAC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCGGCAGA GGGGAGTGGC 540 
CCGCGCGGAA AG CGCCGCGG GACAGTCAGT G ACGAGGCCC GGGGGTCGCC GGGGCCACGA 600 
CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GACGCGCTGT CCGCGCCCAG GGTGGTGCCA 660 
TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT CATCCTGGAA CGCCGCTTCG CTCCTGCAGC 720 
TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA GCGGCGAGTA CTGCCACGGC 780 
TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCG ACGGC 840 
GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 
GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 960 
CCCGCGOACA AAGACGGGCC CCGACGGCTO GGCAGGGCTT CAT6TCTTAG GGOTACCCAA 1020 
GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGCGGTGCTC CCCTGAACGC 10S0 
TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT TTCCCGGGGC TGCTGCCCCG TGCCAGACGC 1 J4D 
CGCGGATTCC CATCTTCTCC ACGCGGCGGC CCCTCTCCCC TGCAGCGGCC CGCCTTGCCC 1200 
ATCrACGTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 
CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 
GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA TCCCCAGTGC CAGCACCTCC 1380 
CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 
GGGGCCCGGG CGCCCCCAAC AAGGTCACAG ACCAACTGTT GCTTGCCGGA AGGGACCATG 1500 
AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 1S60 
CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 
CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 
AGGCAGATTC AGTCCCCCTT CCCTCACACG AACAGTGAAC AGAAGATGTA CCCAGCGGTG 1740 
ACTGTATAAC CGAGAGTCAC TGGTQGGTTC CTTTACTG AA GGGAGACGAA GGCAGGGGTG 1 800 
GATTCTCGAG GTGGAAGTCC GCACATGTCG GTGGTATTTA TGGCACGATT CCTTTGGATG I860 
GCTTCATTTG CCCCCAG ACT GTATG A AAAC ATCTCCGA AT TAGCATTTCT GGATATGTTT 1 920 
CATCCAGGGT ATCATTGATT TATGATGGAA AACCGGCCTC AGCTGGAGAT GACTGTGATG 1980 
ITGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC CCTTGAGATA TGG'ITGACGA 2040 
AAGAATT IT A TAAACTGATA AATTAAGGAT TTTTATTATG TTGTTATf AT TATTTCTnT 2 1 00 
TTGTTGTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 2160 
TTTTTTTTTT TTTTTTTTAA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 
TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGG AT TCAGGCAACA CTCCTGCGTC 2280 
AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 
TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 
TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 
TGAGGCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 
ATTCTAA AAG GAAACCTGTT TGAACTCTGT GAG ATCAGTC ATCAGTCTCA GTATTCCACA 25B0 
GGCACACCTT AATTTCATTG TAAAAAGATA TATATATTTT GTCTATTTTT GTGCTTTTGG 2640 
GGGGCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 
TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 
TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 
TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2880 . 
TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 
TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 
SAGTGCAAG1 TTACT/ \ CTTGTAAATG AAGA'l n n >"l VTCI VI v 3060 
AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATCi TGTTACAAACs ATACTTAGTA 3120 
TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 
GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 
GATTGTTAAG AOAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 
ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 
CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 
ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 3480 
GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3540 
TGTTTCATGA CTGAATA ATG TAAA ACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3 600 
TTTTTTAATA ACCAAAGGCA GGGGAAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 44 Protein sequence: 

Protein Accession ik FGENESH predicted 

1 11 21 31 41 5! 
I I I I i I 

ML.SGrI.MKPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRERESRPBA AGLLWDRAAA 60 
GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP RLRPGRSRGR VRLPVKPPEA 120 
SGRQPRGPSD CIPRFPSASA THKAVPKGTG PPAEDGDGLG APGPRAKRRR LLGVAAEGSG ISO 
PRGKRRGTVS DEARGSFGPR LLGDRPALSG DALSAPRVVP CGALAARPSP HPGTPLRSCS 240 
CCWLRCWRRG RGPSGEYCHG WLDAQGVWKI GFQCPERFDG GDATICCGSC ALRYCCSSAE 300 
ARLDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 
SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP [YVPFL1VGS VFVAFIILGS 420 
LVAACCCRCL RPK.QDPQQSR APGGNR1MET IPMIPSASTS RGSSSRQSST AASSSSSANS 480 
GARAPPTRSQ TNCCLPEGTM NNVYVNMPTN FSVLNCQQAT QIVPHQGQYL HPPYVGYTVQ 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 

Seq ID NO: 45 DNA sequence 

Nucleic Acid Accessions: NM_00228S 

Coding sequence: 55-3738 (underlined sequences correspond to start and stop codons) 
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1 II 21 31 41 51 

1 I 1 I 1 1 

GCCCGAGCCT CGGCGGCGOC GGTAGCGGCG GCGGCGACGC TGACACCTCC CACCATGGAC 60 
AGCTTCG ACT TAGCCCTGCT CCAGGAATGG GACCTCGAGT CACTGTGTGT CTATGAACCA 120 
GATAGAAATG CATTACGGAG GAAAGAACGA GAAAGAAGAA ATCAAGAAAC TCAACAGGAT 180 
GATGGCACGT TTAATTCTAG TTACTCTCTC TTCAGTGAGC CCTACAAGAC TAACAAGGGG 240 
GATGAACTCT CCAACCGGAT CCAGAACACT TTAGGCAATT ATGATGAAAT GAAAGACTTT 300 
TTAACTGATA GAACCAATCA GAGTCATCTC GTTGGAGTTC CCAAACCTGG GGTTCCTCAG 360 
ACTCCTGTGA ACAAGATCGA TGAACATTTT GTTGCAGATT CAAGAGCCCA GAACCAGCCC 420 
TCGTCTATCT GTAGCACTAC AACTTCCACA CCAGCAGCTG TCCCCGTGCA GCAGAGTAAA 480 
AGAGGCACTA TGGGCTGGCA GAAGGCTGGG CACCCACCCT CTGACGGCCA ACAGAGAGCA 540 
ACACAACAGG GCTCTCTCAG GACCTTGCTT GGAGATGGTG Ti'GGCAGA CA GCAGCCTCGG 60(1 
GCCAAACAAG TGTGCAATGT GGAGGTGGGC CTTCAGACCC AGGAGAGGCC ACCTGCCATG 660 
GCGGCCA AGC ACAGCAGCAG CGGACACTGT GTTCAGAACT TTCCTCCATC CCTAG CTTCA 720 
AAACCCAGCC TGGTCCAGCA GAAACCGACC GCGTATGTGA GGCCAATGGA CGGCCAAGAT 780 
CAGGCCCCTG ATGAGTCTCC TAAGCTGAAG TCGTCTTCGG AAACCAGCGT GCACTGCACA 840 
TCATACAGGG GAGTCCCTGC CAGCAAGCCG GAGCCTGCCA GAGCCAAGGC CAAGCTCTCC 900 
AAGTTCAGCA TCCCCAAGCA GGGGGAGGAG AGTAGATCTG GAGAAACCAA CAGCTGTGTT 960 
GAAGAAATAA TCCGGGAGAT GACCTGGCTT CCACCACTTT CTGCTATTCA AGCACCTGGC 102O 
AAAGTGGAAC CAACCAAATT TCCATTTCCA AATAAGGACT CTCAGCTTGT ATCCTCTGGA 1060 
CACAATAATC CAAAGAAAGG TGATGCAGAG CCAGAGAGTC CAGACAATGG CACATCGAAT 1 140 
ACATCAATGC TGGAAGATGA CCTTAAGCTA AGCAGTGATG AAGAGGAGAA TGAACAGCAG 1200 
GCAGCTCAGA GAACGGCTCT CCGCGCTCTC TCTGACAGCG CCGTGGTCCA GCAGCCCAAC 1260 
TGCAGAACCT CGGTGCCTTC CAGCAAGGGC AGCAGCAGCA GCAGCAGCAG CGGCACGAGC 1320 
AGCTCCTCCA GCGACTCAGA GAGCAGCTCC GGATCTGACT CGGAGACCGA GAGCAGCTCC 13BO 
AGCGAGAGTG AGGGCAGCAA GCCCCCCCAC TTCTCCAGCC CCGAGGCTGA ACCGGCATCC 144D 
TCTAACAAGT GGCAGCTGGA TAAATGGCTA AACAAAGTTA ATCCCCACAA GCCTCCTATT 1500 
CTGATCCAAA.ATGAAAGCCA CGGGTCAGAfl AGCAATCAGT ACTACAACCC GGTGAAAGAG IS60 
GACGTCCAGG ACTGTGGGA UFTCOO *r( m iC AGC O ' "CCTGAG AGAGAAGGAG 1620 
ATCAAGAGCA CTTGCAAGGA GGAOCAAAGG CCAAGGACAG CCAACAAGGC CCCTGGGAGT 1680 
AAAGGCGTGA AGCAGAAGTC CCCGCCCGCG GCCGTGGCCG TGGCGGTGAG CGCAGCCGCC 1740 
CCGCCACCCG CAGTGCCCTG TGCGCCCGCG GAGAACGCGC CCGCGCCTGC CCGGAGGTCC 1800 
GCGGGCAAGA AGCCCACCAG GCGCACCGAG AGGACCTCAG CCGGGGACGG CGCCAACTGC 1860 
CACCGGCCCG AGGAGCCCGC GGCCGCGGAC GCCJCTGGGGA CGAGCGTGGT GGTCCCCCCG 1920 
GAGCCCACCA AAACCAGGCC CTGTGGCAAC AACAGAGCGA GCCACCGCAA GGAGCTGCGC 1980 
TCCTCCGTGA CCTGCGAGAA GGGCCGCACG CGGGGGCTAA GCAGGATCGT CCCCAAATCC 2040 
AAGGAGTTCA TTGAGACAGA GTCGTCATCT TCATCCTCCT CCTCGGACTC CGACCTGGAG 2100 
TCCGAGCAGG AGGAGTACCC TCTGTCCAAA GCACAGACCG TGGCTGCCTC TGCCTCCTCC 2160 
GGGAATGATC AGAGGCTGAA GGAGGCCGCT GCCAACGGGG GCAGTGGTCC TAGGGCCCCT 2220 
GTAGGCTCCA TCAACGCCAG GACCACCAGT GACATCGCCA AGGAGCTGGA GGAGCAGTTC 2280 
TACACACTGG TCCCCTTTGG CCGGAACGAA CTTCTCTCCC CTCTAAAGGA CAGTGATGAG 2340 
ATCAGGTCTC TCTGGGTCAA AATCGACCTG ACCCTCCTGT CCAGGATCCC AGAACACCTG 2400 
CCCCAGGAGC CAGGGGTATT GAGCGCCCCT GCCACCAAGG ACTCTGAGAG CGCACCGCCC 2460 
AGCCACACCT CGGACACACC TGCAGAAAAG GCTTTGCCAA AATCCAAGAG GAAACGCAAG 2520 
TCJTOACAACO AAGACGACTA CAGGGAGATC AAOAAGTCCC AGGCAGAGAA AGACAGCTCT 2550 
TCAAGACTGG CCAGCTCCAC CAGTAATACT TTGTCTGCAA ACCACTGCAA CATGAACATC 2640 
AACAGTGTGG CAATACCAAT AAATAAAAAT GAAAAAATGC TTCGGTCGCC CATCTCACCC 2700 
CTCTCTGATG CATCTAAACA CAAATACACC AGCGAGGACT TAACTTCTTC CAGCCGACCT 2760 
AATGGCAACA GTTTGTTTAC TTCAGCCTCT TCCAGCAAAA AGCCTAAGGC CGACAGCCAG 2820 
CTGCAGCCTC ACGGCGGAGA CCTCACGAAA GCAGCTCAGA ACAATTCTGA AAACATTCCC 2B80 
CTCCACAAGT CACGGCCGCA GACGAAGCCG TGGTCTCCAG GCTCCAACGG CCACAGGGAC 2940 
TGCAAGAGGC AGAAACTTGT CTTCGATGAT ATGCCTCGCA GTGCCGATTA TTTTATGCAA 3000 
. GAAGCTAAAC GAATGAAGCA TAAAGCAGAT GCAATGGTGG AAAAGTTTGG AAAGGCTTTG 3060 
AACTATGCTG AAGCAGCATT GTCGTTTATC GAGTGTGGAA ATGCAATGGA ACAAGGCCCC 3120 
ATGGAATCCA AATCTCCTTA TTACCTGATG TATTCAG AAA CAGTAGAGCT CATCAGGTAT 3 1 50 
GCTATGAGAC TAAAAACCCA CTCAGGCCCC AATGCCACAC CAGAAGACAA ACAACTGGCT 3240 
GCATTATGTT ACCGATGCCT GGCCCTCCTG TACTGGCGGA TGTTTCGACT CAAAAGGGAC 3300 
CACGCTGTAA AGTATTCAAA AGCACTAATC GACTATTTCA AGAACTCATC TAAAGCCGCC 3360 
CAAGCCCCAT CTCCGTGGGG GGCCAGTGGA AAGAGCACTG GAACCCCATC CCCCATTTCT 3420 
CCCAACCCCTTTCCCGGCAG CTCCGTGGGG TCTCAGGGCA GCCTCTCCAA CGCCAGCGCC 3480 
CTG CGTC 3ACC rCGl GCATO GCGCATCC ACCAGATGGC GGC ACCAC 354C 

GTCAGCATCA CCAACAGCAT CCTGCACAGC TACGACTACT GGGAGATGGC CGACAACCTG 3600 
GCCAAGGAAA ACCGAGA AIT C 1TCAACGAC CTGGATCTGC TCATGGGGCC GGTCACCCTG 3660 
CACAGCAGCA TGGAGCACCT GGTCCAGTAC TCCCA ACAGG GCCTGCACTG GCTGCGGAAC 3720 
AGCGCCCACC TGTCATAGGG ACCTCACCCT GGGGCCAGAG TGGGCTCTGG TCTCCACAGA 3780 
TGGCTCAACG TTTTTGGACA CTGTGCTACT GAAACTCCCA GCCACAGCAT TTATAGACTG 3S40 
CGGTGAACAT TTCCTCA 

SeqlDNO; 46 Protein sequence: 
Protein Accession #: NPJJ02276 

1 11 21 3! 41 51 

I I 1 I 1 I 

MDSFDLALLQ EWDLESLCVY EPDRNALRRK ERERRNQETQ QDDGTFNSSY SLFSEPYKTN 60 
KGDELSNR1Q NTLGNYDEMK DFLTDRTNQS HLVGVPKPGV PQTPVNKIDE HFVADSRAQN 120 
QPSSICSTTT STPAAVPVQQ SKRGTMG WQK AGHPPSDGQQ RATQQGSLRT LLGDG VGRQQ 180 
PRAKQVCNVE VGLQTQERPP AM A AKHSSSG KCVQNFPPSL ASKPSLVQQK PTAY VRPMDG 240 
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QDQAPDESPK LKSSSETSVH CTSYRGVPAS KPEPARAKAK LSKFSIFKQG EESRSGETNS 300 
CVEEIIREMT WLPFLSAIQA PGKVEPTKFP FPNKDSQLVS SGHNNPKKGD AEPESPDNGT 360 
SNTSMLEDDL KLSSDEEENE QQA AQRTALR ALSDSA WQQ PNCRTSVPSS KGSSSSSSSG 420 
TSSSSSDSES SSOSDSETES SSSESEGSKP PHFSSPEAEP ASSNKWQLDK WLNKVNPHKP 480 
5 PILIQNESHG SESNQYYNPV KEDVQDCGKV PDVCQPSLRE KEKSTCKEE QRPRTANKAP 540 

GSKGVKQKSP PA AVAVAVSA A APPPAVPCA PAENAPAPAR RSAGKKPTRR TERTSAGDGA 600 
NCHRPEEPAA ADALGTSVVV PPEPTKTRPC GNNRASHRKE LRSSVTCEKR RTRGLSRJVP 660 
KSKEFIETES SSSSSSSDSD LESEQEEYPL SKAQTVAASA SSGNDQRLKE AAANGGSOPR 720 
APVGSINART TSDIAKELEE QFYTLVPFGR NELLSPLKDS DEIRSLWVKI DLTLLSRIPE 780 

10 HLPQEPGVLS APATKDSESA PPSHTSDTPA EKALPKSKRK RKCDNEDDYR EIKKSQGEKD 840 
SSSRLATSTS NTLSANHCNM NINSVAIP1N KNEKMLRSPI SPLSDASKI iK YTSEDU'SSS 900 
RF GNSLFTS ASSSKKPK I SQLQPHGfiDL 1 ' 1 „L , i, , ' i'yTM ' r Ujt! 
RDCKRQKLVF DDMPRSADYF MQEAKRMKHK ADAMVEKFGK ALNYAEAALS FIECGNAMEQ 1020 
GPMESKSPYY LMYSETVELI RYAMRLKTHS GPNATPEDKQ LAALCYRCLA LLYWRMFRLK 1080 

1 5 RDHAVKYSKA LIDYFKNSSK AAQAPSPWGA SGKSTGTPSP ISPNPFPGSS VGSQGSLSNA 1 140 
SALSPSTIVS IPQEIHQMAA NHVS1TNSIL HSYDYWEMAD NLAKENREFF NDLDLLMGPV 1200 
TLHSSMEHLV QYSQQGLHWL RNSAHLS 

Sen ID NO; 47 DNA sequence 
20 Nucleic Acid Accession ± NM 033151 

Coding sequence: 351-4499{underiineil sequences correspond to start and stop codons) 



1 11 21 31 41 51 

25 | | | | I I 

ACTGGG ATAA AGCAAG AAGA CTGATTTTAT G AGCAGGGGT TTG ATACATC AAAGG AG ATT 60 
CCC( 0( \TC . ' '<<iGTGCGG TGTTGGGGGTGGGTTGGGGA GGGTGGTTAG AGAAGGTTTC 120 
ACTAAGTCSAT T1GGGCCTGA GGCCTGAGAA OATGTTTAAA AAGAGGGATC AAGCACAGGC 1 80 
TAAGGAGAGG AAAGAGCAGG CACCCAAACC TCTGCATGGC CCCAATATGC TCCCTGCAGG 240 

3 0 GTAGTGCCCC CTCTTCTGGC TGCTCAAGGC G AGATCTA AG CTTCTTCTAA CTCCTGCTGT 300 
CTTTTCATATTCTCTGATTC TGGGAAACGA AGAATTGGCA GGAACTGAAA ATGACTAGGA 360 
AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC ATCGACATAG 420 
GCGATGACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT GGCCCCTGGA 480 
GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG TGGGGGAAGT 540 

3 5 ATGATGCTGC CTTGAGAACC ATGATTCCCT 7CCGTCCCA A GCCG AGGTTT CCTGCCCCCC 600 
AGCCCCTGG A CAATGCTGG C CTGTTCTCCT ACCTCACCGT GTCATGGCTC ACCCCGCTCA 660 
TGATCCAAAG CTTACGGAGT CGCTTAGATG AGAACACCAT CCCTCCACTG TCAGTCCATG 720 
ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA GTCTCAAGGC 780 
GAGGGATTGA AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA ACAAGGTTGA 340 

40 TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG CCAATATTGA 900 
TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC CATGGAGTGG 960 
GACTCTGCTT TGCCCTTTTT'CTCTCCGAAT GTGTGAAGTC TCTGAGTTTC TCCTCCAGTT 1020 
GG ATCATCAA CCAACOCACA GCCATCAGGT TCCGAGCAGC TGTTTCCTCC TTTGCCTTTG 1 080 
AGA At iCTCAT CCA AT1TAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC ATCAGCTTCT 1 140 

4;s , i , 1 l-AAACl i ;TGTTTGAAG GGGTGTGC'l rGGACCCI A GTACTG rCA 1200 

CCTCCOCATC GCTOGTCATC TGCAGCATTT CTTCCTAGTT CATTATTGGA TACACTOCAT 1260 
TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG ACAAG AATGG 1 320 
CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT GTGACCAGTG 1380 
AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GGAGAAACCA TTTGCAAAAA 1440 

5 0 TCATTG AAGA CCTA AGAAGG AAGGAAAGG A AACTATTGGA GAAGTGCGGG CTTGTCCAG A 1 500 
GCCTGACA AG TATAACCTTG TTCATCATCC CCACAGTGGC CACAGCGGTC TGGGTTCTCA 1 560 
TCCACACATC CTTAAAGCTG AAACTCACAG CGTCAATGGC CTTCAGCATG CTGGCCTCCT 1620 
TGAATCTCCT TCGGCTGTCA GTGTTCTTTG TGCCTATTGC AGTCAAAGGT CTCACG AATT 1 6SO 
CCAAGTCTGC AGTGATGAGG TTCAAGAAGT TTTTCCTCCA GGAGAGCCCT GTTTTCTATG 1740 

55 TCCAGACATT ACAAGACCCC AGCAAAGCTC TGGTCTTTGA GGAGGCCACC TTGTCATGGC 1800 
AACAGACCTG TCCCGGGATC GTCAATGGGG CACTGGAGCT GGAGAGGAAC GGGCATGCTT 1860 
CTGAGGGGAT GACCAGGCCT AGAGATGCCC TCGGGCCAGA GGAAGAAGGG AACAGCCTGG 1920 
GCCCAGAGTT GCACAAGATC AACCTGGTGG TGTCCAAGGG GATGATGTTA GGGGTCTGCG 1980 
GCAACACGGG G AGTGGTAAG AGCAGCCTGT TGTCAGCCAT CCTGGAGGAG ATGCACTTGC 2040 

60 TCGAGGGCTC GGTGGGGGTG CAGGGAAGCC TGGCCTATGT CCCCCAGCAG GCCTGGATCG 2100 
TCAGCGGGA A CATCAGGGAG AACATCCTCA TGC5GAGGCGC ATATGACAAG GCCCGATACC 2160 
TCCAGGTGCT CCACTGCTGC TCCCTGAATC GGGACCTGGA ACTTCTGCCC TTIXSGAGACA 2220 
TGACAGAGAT TGGAGAGCGG GGCCTCAACC TCTCTGGGGG GCAGAAACAG AGGATCAGCC 2280 
TGGCCCGCGC CGTCTATTCC GACCGTCAGA TCTACCTGCT GGACGACCCC CTGTCTGCTG 2340 

65 TGGACGCCCA CGTGGGGAAG CACATTTTTG AGGAGTGCAT TAAGAAGACA CTCAGGGGGA 2400 
AGACGOTCGT CCTGGTGACC CACCAGCTGC AGTACTTAGA ATTTTGTGGC CAGATCATTT 2460 
TGTTGGAAAA TGGGAAAATC TGTGAAAATG GAACTCACAG TGAGTTAATG CAGAAAAAGG 2520 
GGAAATATGC CCAACTTATC CAGAAGATGC ACAAGGAAGC CACTTCGGAC ATGTTGCAGG 2580 
ACACAGCAAA GATAGCAGAG AAGCCAAAGG TAGAAAGTCA GGCTCTGGCC ACCTCCCTGG 2640 

70 AAGAGTCTCT CAACGGA A AT GCTGTGCCGG AG CATCAGCT CACACAGGAG GAGGAGATGG 2700 
AAGAAGGCTC CTTGAGTTGG AGGGTCTACC ACCACTACAT CCAGGCAGCT GGAGGTTACA 2760 
TGGTCTCTTG CATAATTTTC TTCTTCGTGG TGCTGATCGT CTTCTTAACG ATCTTCAGCT 2820 
TCTGGTGQCT GAGCTACTGG TTGGAGCAGG GCTCGGGGAC CAATAGCAGC CGAGAGAGCA 2860 
ATGGAACCAT GGCAGACCTG GGCAACATTG CAGACAATCC TCAACTGTCC TTCTACCAGC 2940 

75 TGGTGTACGG GCTCAACGCC CTGCTCCTCA TCTGTGTGGG GGTCTGCTCC TCAGGGATTT 3000 
TCACCAAAGT CACGAGGAAG GCATCCACGG CCCTGCACAA CAAGCTCTTC AACAAGGTTT 3060 
TCCGCTGCCC CATGAGTTTC TTTGACACCA TCCCAATAGG CCGGCTTTTG AACTGCTTCG 3120 
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CAGGGGACTT GGAACAGCTG GACCAGCTCT TGCCCATCTT TTCAGAGCAG TTCCTGGTCC 3180 
TGTCCTTAAT GGTGATCGCC GTCCTGTTGA TTGTCAGTGT GCTGTCTCCA TATATCCTGT 3240 
TAATGGGAGC CATAATCATG GTTATTTGCT TCATTTATTA TATGATGTTC AAGAAGGCCA 3300 
TCGGTGTGTT CAAGAGACTG GAGAACTATA GCCGGTCTCC TTTATTCTCC CACATCCTCA 3360 
■ 5 ATTCTCTGCA AGGCCTGAGC TCCATCCATG TCTATGGAAA AACTGAAGAC TTCATCAGCC 3420 
AGTTTAAGAG GCTGACTGAT GCGCAGAATA ACTACCTGCT GTTGTTTCTA TCTTCCACAC 34SO 
GATGGATGGC ATTGAGGCTG GAGATCATGA CCAACCTTGT GACCTTGGCT GTTGCCCTGT 3540 
TCGTGGCTTT TGGCATTrCC TCCACCCCCT ACTCCTTTAA AGTCATGGCT GTCAACATCG 3600 
TGCTGCAGCT GGCGTCCAGC TTCCAGGCCA CTGCCCGGAT TGGCTTGGAG ACAGAGGCAC 3660 

1 0 AGTTCACGGC TGTAGAGAGG ATACTGCAGT ACATGAAGAT GTGTGTCTCG GAAGCTCCTT 3720 
TACACATCGA AGGCACAAGT TGTCCCCAGG GGTGGCCACA GCATGGGGAA ATCATATTTC 3780 
AGGATTATCA CATGAAATAC AGAGACAACA CACCCACCGT uCTTCACGGC ATCAACCTGA 3840 
CCATCCGCGG CCACGAAGTG GTGGGCATCG TGGGAAGGAC GGGCTCTGGG AAGTCCTCCT 3900 
TGGGCATGGC TCTCTTCCGC CTGGTGGAGC CCATGGCAGG CCGGATTCTC ATTGACGGCG 3960 

1 5 TGGACATTTG CAGCATCGGC CTGGAGGAGT TGCGGTCCAA GCTCTCAGTG ATCCCTCAAG 4020 
AtCCAGTGCT GCTCTCACGA ACCATCAGAT TCAACCTAGA TCCCTTTGAC CGTCACACTG 408O 
ACCAGCAGAT CTGGGATGCC TTGGAGAGGA CATTCCTGAC CAAGGCCATC TCAAAGTTCC 4140 
CCAAAAAGCT GCATACAGAT GTGGTGGAAA ACGGTGGAAA CTTCTCTGTG GGGGAGAGGC 420D 
AGCTGCTCTG CATTGCCAGG GCTGTGCTTC GCAACTCCAA GATCATCCTT ATCGATGAAG 4260 

20 CCACAGCCTC CATTGACATG GAGACAGACA CCCTGATCCA GCGCACAATC CGTGAAGCCT 4320 
TCCAGGGCTG CACCGTGCTC GTCATTGCCC ACCGTGTCAC CACTGTGCTG AACTGTGACC 4380 
ACATCCTGGT TATGGGCAAT GGOAAGQTGG TAGAATTTGA TCGGCCGG AG GTACTGCGGA 4440 
AGAAGCCTGG GTCATTGTTC GCAGCCCTCA TGGCCACAGC CACTTCTTCA CTGAGATAAG 4500 

„ ■ GAGATGTGGA GACTTCATGG AGGCTGGCAG CTGAGCTCAG AGGTTCACAC AGGTGCAGCT 4560 

25 TCGAGGCCCA CAGTCTGCGA CCTTCTTGTT TGGAGATGAG AACTTCTCCT GGAAGCAGGG 4620 
GTAAATCTAG GGGGGGTGGG GATTGCTGGA TGGAAACCCT GGAATAGGCT ACTTGATGGC 4680 
TCTCAAGACC TTAGAACCCC AGAACCATCT AAGACATGGG ATTCAGTGAT CATGTGGTTC 4740 

tccttttaac itacatgc yg aataatttta taataaggta aaagcttata gttttctgat 4800 
ctgtgttaga agtgttgcaa atgctgtact gactttgtaa aatataaaac taaggaaaac 4860 
30 tc 

Scq ID NO; 48 Praidnjegucncei 
Protein Accession*: NPJ49163.2 

3 5 MTRKRTYWVPNSSGGLV1WGIDIGDDMVSGLIYKTYTLQDGPWSQQERNPEAPGRAAVPPWGKYDAALRT 
MffFEPKPRFPAPQPl-DNAaLFSVLTVSWLTFLMIQSLRSRLDENTlPPLSVHDASDKNVQRLEiRLWEEE 
VSRRGIEKASVLLVMLRFQRTRLIFDALLGICFOASVLGPILIIP1CILEYSEEQLGNVVHGVGLCFALF 
LSECVKSLSFSSSWIINQRTAIRFRAAVSSFAFEKLIQFKSVIHITSGEAISFFTGDVNYLFEGVCYGPL 
VL1TCASLVICSISSYFIIGYTAFIAILCYLLVFPLAVFMTRMAVKAQHHTSEVSDQRIRVTSEVLTCIK 

40 LIKMYTWEKPFAKIIEDLRRKEEtKLLEKCGLVQSLTSlTLFnPTVATAVWVLlHTSLKLKLTASMAFSM 
LASLNLLRLSVFFVPIAVKGLTNSKSAVMRFKKFFLQESPVFYVQTLQDFSKALVFEEATLSWQQTCPGI 
VNGALELERNGHASEGMTRPRDALGPEEEGNSLGPELHKINLWSKGMMLGVCGNTGSGKSSLLSAILEE 
ME1LLEGSVGVQGSLAYVPQQAWIVSGNIREKILMGGAYDKARYLQVLHCCSLNRDLELLPFGDMTEIGER 
GLl\XSGGrjKQPJS[.ARAVYKDRQIYLLDDPLSAVDAIlVGKHi:-'hHciKKTLRGKTVVLVTHOLQYLEFCG 

45 Q]ILLEKGKICENGTilSliLMQKKGXYAQ]-lQK\lHKEATSDi%lLQDTAKIAEKl'KVESQALATSLEESLNGN 
AVPEHkLT^r 1 GSI.SW IUI K C?GYMVS0IFW UVF 1J3FWWLSYWLHQGSGTNSS 
RESNGTMADLGNIADNPQLSFYQLVYGLNALLLICVGVCSSGIFTKVTRKASTALHNKLFNKVFRCPMSF 
FDTIPlGRLLNCFAGDLEQLDQLLPIFSEQFLVLSLMVLAVLLlVSVLSPYiLLMGAIIMVICFrYYMMF 
KKA1GVFKRLENYSRSPLFSHILNSLQGLSSIHV YGKTEDHSQFKRLTDAQNNYU.LFLSSTRWMALRL 

50 EaiTMLVTLAVALFVAFGISSTPYSFKVMAVNIVLQLASSFQATARIGLETEAQFTAVERILQYMK>ICVS 
EAPLHMEGTSCPQGWPQHGEIIFQDYHMKYRDNTPTVLHGINLTIRGIffiWGIVGRTasGKSSl-GMALFR 
LVEPMAGRILIDGVDICSIGLEDLRSKLSVIPQDPVLLSGT1RFNLDPFDRHTDQQIWDALERTFLTKAI 
SKFPKKLHTDVVENGGNFSVGERQLLCIARAVLRNSKIILlDEATASIDMETDTLIQRTffiEAFQGCTVL 

^ VIAHRVrrVLNCDHlLVMGNGKVVEFDRPEVLRKKPGSLFAALMATATSSLR 

Seq !D NO: 49 DNA sequence 

Nucleic Acid Accession #: NM 03341 9 

Coding sequence: 18-980 (underfilled sequences correspond to sfart and stop cadons) 

60 1 11 21 31 41 51 
I I I I I I 

CGAGCCAGGG AOAAAGGATG GCCGGCCTGG CGGCGCGGTT GGTCCTGCTA GCTGGGGCAO 60 
CGGCGCTGGC GAGCGGCTCC CAGGGCGACC GTGAGCCGGT GTACCGCGAC TGCGTACTGC 120 
AGTGCGAAGA GCAGAACTGC TCTGGGGGCG CTCTGAATCA CTTCCGCTCC CGCCAGCCAA 180 

65 TCTACATGAG TCTAGCAGGC TGGACCTGTC GGGACG ACTG TAAGTATGAG TGTATGTGGG 240 
TCACCGTTGG GCTCTACCTC CAGGAAGGTC ACAAAGTGCC TCAGTTCCAT GGCAAGTGGC 300 
CCTTCTCCCG GTTCCTGTTC TTTCAAGAGC CGGCATCGGC CGTGGCCTCG TTTCTCAATG 360 
GCCTGGCCAG CCTGGTGATG CTCTGCCGCT ACCGCACCTT CGTGCCAGCC TCCTCCCCCA 420 
TGTACCACAC CTGTGTGGCC TTCGCCTGGG TCTCCCTCAA TGCATGGTTC TGGTCCACAG 4S0 

70 TTTTCC ACAC CAGGGACACT GACCTCACAG AGAA AATGGA CTACTTCTGT GCCTCCACTG 540 
TCATCCTACA CTCAATCTAC CTGTGCTGCG TCAGGACCGT GGGGCTGCAG CACCCAGCTG 600 
TGGTCAGTGC CTTCCGGGCT CTCCTGCTGC TCATGCTGAC CGTGCACGTC TCCTACCTGA 660 
GCCTCATCCG CTTCGACTAT GGCTACAACC TGGTGGCCAA CGTGGCTATT GGCCTGGTCA 720 
ACGTGGTGTG GTGGCTGGCC TGGTGCCTGT GGAACCAGCG GCGGCTGCCT CACGTGCGCA 780 

75 AGTGCGTGGT GGTGGTCTTG CTGCTGCAGG GGCTGTCCCT GCTCGAGCTG CTTGACTTCC 840 
CACCGCTCTT CTGGGTCCTG GATGCCCATG CCATCTGGCA CATCAGCACC ATCCCTGTCC 900 
ACGTCCTCTT TTTCAGCTTT CTGGAAGATG ACAGCCTGTA CCTGCTGAAG GAATCAGAGG 960 
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ACAAGTTCAA GCTGGACTGA AGACCTTOGA GCGAGTCTGC CCCAGTGGGG ATCCTGCCCC 1020 
CGCCCTGCTC GCCTCCCTTC TCCCCTCAAC CCTTGAGATG ATTTTCTCTT TTCAACTTCT 1 080 
TGAACTTGGA CATGAAGGAT GTGGGCCCAG AATCATGTGG CCAGCCCACC CCCTGTTGGC 1140 
, CCTCACCAGC CTTGGAGTCT GTTCTAGGGA AGGCCTCCCA GCATCTGGGA CTCGAG AGTG 1 200 
5 GGCAGCCCCT CTACCTCCTG GAGCTGAACT GGGGTGGAAC TGAGTGTGCT CTTAGCTCTA 1260 
CCGGGAGGAC AGCTGCCTGT TTCCTCCCCA TCAGCCTCCT CCCCACATCC CCAGCTGCCT 1320 
GGCTGGGTCC TGAAGCCCTC TGTCTACCTG GGAGACCAGG GACCACAGGC CTTAGGGATA 1380 
CAGGGGGTCC CCTTCTGTTA CCACCCCCCA CCCTCCTCCA GGACACCAGT AGGTGGTGCT 1440 
GGATGCTTGT TCTTTGGCCA GCCAAGGTTC ACGGCGATTC TCCCCATGGG ATCTTGAGGG 1500 
1 0 ACCAAGCTGC TGGGATTGGG AAGGAGTTTC ACCCTGACCA TTGCCCTAGC CAGGTTCCCA 1560 
GGAGGCCTCA CCATA.CTCCC TTTCAGGGCC AGCKICTCCAG CAAGCCCAGG GCAA.GGATCC 1620 
TCTGCTGCTG TCTGGTTGAU AGCCTGCCAC CGTGTGTCGG GAGTGTGGGC CAGGCTGAGT 1680 
GCATAGGTGA CAGGGCCGTG AGCATGGGCC TGGGTGTGTG TGAOCTCAGG CCTAGGTGCG 1740 
CAGTGTGGAG ACGGGTGTTG TCGGGGAAGA GGTGTGGCTT CAAAGTGTGT GTGTGCAGGG 1 800 
1 5 GGTGGGTGTG TTAGCGTGGG TTAGGGG AAC GTGTGTGCGC GTGCTGGTGG GCATGTGAGA 1 860 
TGAGTGACTG CCGGTGAATG TGTCCACAGT TG AG AGGTTG GAGCAGGATG AGGG AATCCT 1 920 
GTCACCATCA ATAATCACTT GTGGAGCGCC AGCTCTGCCC AAGGCGCC AC CTGGGCGGAC 1980 
AGCCAGGAGC TCTCCATGGC CAGGCTGCCT GTGTGCATGT TCCCTGTCTG GTGCCCCnT 2040 
GCCCGCCTCC TGCAAACCTC ACAGGGTCCC CACACAACAG TGCCCTCCAG AAGCAGCCCC 2100 
20 TCGGAGGCAG AGGAAGGAAA ATGGGGATGG CTGGGGCTCT CTCCATCCTC CTTTTCTCCT 2160 
TGCCTTCGCA TGGCTGGCCT TCCCCTCCAA AACCTCCATT CCCCTGCTGC CAGCCCCTTT 2220 
GCCATAGCCT GATTTTGGGG AGGAGGAAGG GGCGATTTGA GGGAGAAGGG GAGAAAGCTT 22S0 
ATGGCTGGGT CTGGTTTCTT CCCTTCCCAG AGGGTCTTAC TGTTCCAGGG TGGCCCCAGG 2340 
GCAGGCAGGG GCCACACTAT GCCTGCGCCC TGGTAAAGGT GACCCCTGCC ATTTACCAGC 24C0 
25 AGCCCTGGCA TGTTCCTGCC CCACAGGAAT AGAATGGAGG GAGCTCCAGA AACTTTCCAT 2460 
CCCAAAGGCA GTCTCCGTGG TTGAAGCAGA CTGGATTTTT GCTCTGCCCC TGACCCCTTG 2520 
TCCCTCTTTG AGGGAGGGGA GCTATGCTAG GACTCCAACC TCAGGGACTC GGGTGGCCTG 2580 
CGCTAGCTTC TT1TGATACT GA AAACTTTT AAGGTGGG AG GGTGGCAAGG G 
ATAAATCAAT TCCAAGCCTC AAAAAAAAAA AAAAAAAAAA AA 



30 
35 
40 
45 



60 



Scq ID NQ: SO Protein sequence: 



i QPTFCWPSPA LESVLGKASQ HLGLESGQPL YLLELNWGGT ECALSSTGRT 60 
AACFLPISLL PTSPAAWI.GP EALCLPGRPG TTGLRDTGGF LLLPPPTLLQ DTTRWCWMLV 120 
LWPAKVHGDS PHGILRDQAA GIGKEFHPDH CPSQVPRRPH HTPFQGQGSS KPRARILCCC 1 80 
LVESLPPCVG SVGQAECIGD RAVSMGLGVC ELRPRCAVWR RVLSGKRCGF KVCVCRGWVC 

Seq ID NO: 51 DNA sequence 

Nucleic Acid Accession #■ XM 059098.1 

Coding sequence: 1 78-61 8(uiidcrlined sequences correspond to sum and stop codons) 



1 11 21 31 41 51 
I I I I I I 
GATGTACACT CTGAAGTGAG CACATTCCTG TTGGCAGGAC ATGACACCTT GGCAGCAAGC 60 
n ATCTCCTGGA TCCTTTACTG CCTGGCTCTG AACCCTGAGC ATCAAGAGAG ATGCCGGGAG 120 
5 0 GAGGTCAGGG GCATCCTGGG GGATGGGTCT TCTATCACTT GGGACCAGCT GGGTGAG ATG 1 80 
TCGTACACCA CAATGTGCAT CAAGGAGACG TGCCGATTGA TTCCTGCAGT CCCGTCCATT 240 
TCCAGAGATC TCAGCAAGCC ACTTACCTTC CCAGATGGAT GCACATTGCC TGCAGGGATC 300 
ACCGTGQTTC TTAGTATTTG GGGTCTTCAC CACAACGCTG CTGTCTGGAA AAACCCAAAG 360 
_„ GTCTTTGACC CCTTGAGGTT CTCTCAGGAG AATTCTGATC AGAGACACCC CTATGCCTAC 420 
5 5 TTACCATTCT CAGCTGG ATC AAGGAACTGC ATTGGGCAGG AGTTTGCCAT GATTG AGTTA 480 
AAGGTA ACCA TTGCCTTGAT TCTGCTCCAC TTCAGAGTGA CTCCAGACCC CACCAGGCCT 540 
CTTACTTTCC CCAACCATTT TATCCTCAAG CCCAAGAATG GGATGTATTT GCACCTGAAG 600 
AAACTCTCTG AATGTTAGAT CTCAGGGTAC AATGATTAAA CGTACTTTGT TTTTCGAAGT 660 
TAAATTTACA GCTAATGATC CAAGCAGATA GAAAGGGATC AATGTATGGT GGGAGGATTG 720 
GAGGTTGGTG GGATAGGGGT CTCTGTGAAG AGATCCAAAA TCATTTCTAG GTACACAGTG 780 
TGTCAGCTAG ATCTGTTTCT ATATAACTTT GGGAGATTTT CAGATCTTTT CTGTTAAACT 840 
TTCACTACTA TTAATGCTGT ATACACCAAT AGAC1TTCAT ATATTTTCTG TTGTTTTTAA !J0D 
AATAGTTTTC AGAATTATGC AAG'I AATAAG TGCATGTATG CTCACTGTCA AAAATTCCCA 960 
ACACTAGAAA ATCATGTAGA ATAAAAATTT TAAATCTCAC TTCACTTAGC CGACATTCCA 1020 
0 5 TGCCCTGACC A ATCCTACTG CTTTTCCTA A AAACAGA ATA ATTTGGTGTG CATTCTTTCA 1 080 
GACTTTTTCC TATACATTTT ATATGTAGAA ATGTAGCAAT GTATTTGTAT AGATGTGATC 1 140 
ATTCCTATAT TGTTATTGAT TTTTTTCACT TAATAAAAAT TCACCTTATT CCTT 

Seq ID NO: 52 Protein sequence: 



MSYTTMCIKE TCRLfPAVPS 1SRDLSKPLT FPDGCTLPAG ITVVLS1WGL HHNPAVWKNP 60 
KVFDPLRFSQ ENSDQRHPYA YLPFSAGSRN C1GQEFAMIE LKVTIALILL HFRVTPDPTR 120 
PLTFPNHFIL KPKNGMYLHL KKLSEC 
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Seq ID NO: 53 DNA sequence 

Nucleic Acid Accession U: NM_0309 1 6 

Coding sequence: 1-1S33 (underlined sequences correspond to start and stop codons) 

5 1 II 21 31 41 51 
I I ] I I 1 

ATGCCCCTGT CCCTGGGAGC CGAGATGTGG GGGCCTGAGG CCTGGCTGCT GCTGCTGCTA 60 
CTGCTOGCAT CATTTACAGG CCGGTGCCCC GCGGGTGAGC TGGAGACCTC AGACGTGGTA 120 
ACTGTGGTGC TGGGCCAGGA CGCAAAACTG CCCTGCTTCT ACCGAGGGGA CTCCGGCGAG 1 80 
1 0 CAAGTGGGGC AAGTGGCATG GGCTCGGGTG GACGCGGGCG AAGGCGCCCA GGAACTAGCG 240 
CTACTGCACT CCAAATACGG GCTTCATGTG AGCCCGGCTT ACGAGGGCCG CGTGGAGCAG 300 
CCGCCGCCCC CACGCAACCC CCI'GGACGGC TCAGTGCTCC TGCGCAACGC AGTGCAGGCG 3S0 
GATGAGGGCG AGTACGAGTG CCGGGTCAGC ACCTTCCCCG CCGGCAGCTT CCAGGCGCGG 420 
CTGCGGCTCC GAGTGCTGGT GCCTCCCCTG CCCTCACTGA ATCCTGGTCC AGCACTAGAA 480 
1 5 GAGGGCCAGG GCCTGACCCT GGCAGCCTCC TGCACAGCTG AGGGCAGCCC AGCCCCCAGC 540 
GTGACCTGGG ACACGGAGGT CAAAGGCACA ACGTCCAGCC GTTCCTTCAA GCACTCCCGC 600 
TCTGCTGCCG TCACCTCAGA GTTCCACTTG GTGCCTAGCC GCAGCATGAA TGGGCAGCCA 660 
CTGACTTGTG TGGTGTCCCA TCCTGGCCTG CTCCAOGACC AAAGGATCAC CCACATCCTC 720 
_ . CACGTGTCCT TCCTTGCTGA GGCCTCTGTG AGGGGCCTTG AAGACCAAAA TCTGTGGCAC 780 
20 ATTGGCAGAG AAGGAGCTAT GCTCAAGTGC CTGAGTGAAG GGCAGCCCCC TCCCTCATAC 340 
AACTGGACAC GGCTGGATGG GCCTCTGCCC AGTGGGGTAC GAGTGGATGG GGACACTTTG 900 
GGCTTTCCCC CACTGACCAC TGAGCACAGC GGCATCTACG TCTGCCATGT CAGCAATGAG 960 
TTCTCCTCAA GGGATTCTCA GGTGACTGTG GATGTTCTTG ACCCCCAGGA AGACTCTGGG 1020 
_ _ AAGCAGGTGG ACCTAGTGTC AGCCTCGGTG GTGGTGGTGG GTGTGATCGC CGCACTCTTG 1080 
25 TTCTGGGTTC TGGTGGTGGT GGTGGTGCTG ATGTCCCGAT ACCATCGGCG CAAGGCCCAG 1 140 

CAGATGACCC AGAAATATGA GGAGGAGCTG ACCCTGACCA GGGAGAACTC CATCCGGAGG 1200 
CTGCATTCCC ATCACACGGA CCCCAGGAGC CAGCCGGAGG AGAGTGTAGG GCTGAGAGCC 1260 
G AGG GCCACC CTGATAGTCT CAAGGACAAC AGTAGCTGCT CTGTGATGAG TG A AG AGCCC 1320 
„ GAGGGCCGCA GTTACTCCAC GCTGACCACG GTGAGGGAGA TAGAAACACA GACTGA ACTG 1380 

3 0 CTGTCTCCAG GCTCTGGGCG GGCCGAGGAG GAGGAAGATC AGGATGAAGG CATCAAACAG 1440 
GCCATGAACC ATTTTGTTCA GGAGAATGGG ACCCTACGGG CCAAGCCCAC GGGCAATGGC 1500 
ATCTACATCA ATGGGCGGGG ACACCTGGTC TGA 

Seq ID NO: 54 Prptein sequence: 
35 Protein Accession]?: NP_112178.1 

1 11 21 31 41 51 
I 1 I I I I 

MPLSLGAEMW GPEAWLLLLL LLASFTGRCP AGELETSDVV TVVLGQDAKL PCFYRGDSGE 60 
40 QVGQVAWARV DAGEGAQELA LLHSfCYGLHV SPAYEGRVEQ PPPPRNPLDG SVLLRNAVQA !20 
DEGEYECRVS TFPAGSFQAR LRLRVLVPPL PSLNPGPALE EGQGLTLAAS CTAEGSPAPS 1 80 
VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQF LTCVV5HFGL LQDQR1THIL 240 
HVSPLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 300 
GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQBDSG KQVDLVSASV VVVG VIAAIX 360 
45 FCLLVVVVVL MSRYHRRKAQ QMTQKYEEEL TLTRENSDtR LHSHHTDPRS QPEESVGLRA 420 
EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGIKQ 480 
AMNHFVQENG TLRAKFTGNG IYINGRGHLV 

Seq ID NO: 55 DNA sequence 
50 Nucleic Acid Accession ft AF00717O.I 

Coding sequence: 73-1725 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
,,111111 

55 AAGGAGGCGG CCTCCGGGAA AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 60 
CTGGACCAGT GCATGACCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCACTC 120 
AGCTACCTCA AGCCCAGAAC CAAGGAAAGC ATGTACCACT CACTGACATA TGCCACCATC 180 
CTGGAGATGC AGGCCATGAT GACCTTTGAC CCTCAGGACA TCCTGCTTGC CGGCAACATG 240 
. . ATGAAGG AGG CACAG ATOCT GTGTCAGAGG CACCGG AGGA AGTCTTCTGT AACAGATTCC 300 

60 TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCAATTCA CTGAAGAAGA AATCCACGCT 360 
GAGGTCTGCT ATGCAGAGTG CCTGCTGCAG CGAGCAGCCC TGACCTTCCT GCAGGACGAG 420 
AACATGGTGA GCTTCATCAA AGGCGGCATC AAAGTTCGAA ACAGCTACCA GACCTACAACi 480 
GAGCTGGACA GCCTTG1TCA GTCCTCACAA TACTGCAAGG GTOAGAACCA CCCGCACTTT S40 
GAAGGAGGAG TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTTCCTACT 600 

65 AGGATCCTGA GGCTGTTGGA GTTTGTGGGG TTTTCAGGAA ACAAGGACTA TGGGCTGCTG 660 
CAGCTGGAGG AGGG AGCGTC AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 720 
CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGTCAA CATCGAGGAG 780 
GCCGAGAAGC TCTTOAAGCC CTACCTGAAC CGGTACCCTA AGGGTGCCAT CTTCCTGTTC 840 
TTTGCAGGGA GGATTGAAGT CATTAAAGGC AAC ATTGATG CAGCCATCCG GCGTTTCG AG 900 

70 GAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 960 
ATGTGGTGCT TCACCTACAA GGGCCAGTGG AAGATGTCCT ACTTCTACGC CGACCTGCTC S020 
AGCAAGGAGA ACTGCTGGTC CAAGGCCACC TACATTTACA TGAAGGCCGC CTACCTCAGC 1080 
ATGTTTGGGA AGGAGGACCA CAAGCCGTTC GGGGACGACG AAGTGGAATT ATTTCGAGCT 1 140 
GTGCCAGGCC TGAAGCTCAA GATTGCTGGG AAATCTCTAC CCACAGAGAA GTTTGCCATC 1200 

75 CGGAAGTCCC GGCGCTACTT CTCCTCCAAC CCTATCTCGC TGCCAGTGCC TGCTCTGGAA 1260 
ATGATGTACA TCTGGAACGG CTACGCCGTG ATTGGG AAGC AGCCGAAACT CACGGATGGG 1320 
ATACTTGAGA TTATCACTAA GGCTGAAGAG ATGCTGGAGA AAGGCCCAGA GAACGAGTAC 1380 
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TCAGTGGATG ACGAGTGCTT GGTGAAATTG TTOAAAGGCC TGTGTCTGAA ATACCTGGGC 1440 
CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 1500 
TATGACCACT ACTTGATCCC AAACGCCCTG CTGGAGCTGG CCCTGCTGCT TATGGAGCAA 1 550 
G ACAGAAACG AAGAGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA CAAGAATTAC 1620 
TCCATGG AGT CAAGG ACACA CTTTCG AATC CAGGCAGCCA CACTCCAAGC CAAOTCTTCC 1 680 
CTAG AGAACA GCAGCAGATC CATG GTCTCA TCAGTGTCCT TGTAGCTTTG TGCAGCAGTT 1 740 
CCGGGCTGGA AGACAGAGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1800 
CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGG ATGG CACA AC ATAG 1 860 
TGTATCCGTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 1920 
GGCAGAGCAG GTGGAGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 1980 
GTGATGTTTA AGAGAATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA 1TGATGGGAT 2040 
CAOACriTGGC TfTAAAAACC A ACAACA ATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 2100 
TATCATCTGG AGGTAAATCT CTTTATATGATGATGCCAAA GGGCAAATTG CTTTTCAAAT 2160 
TCAGCAAGTT CTCAGCTTGT GTGACGGAAG GTCCTTCAGA GGACCTGAGG AATGCCTGGG 2220 
AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGG ATGT ACACAGACAC 2280 
CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 2340 
A AGCACGTCT TTTTAGTG AG ATAAAATCTG AGCTCTTCTG TAGAAAAATC AATCTCTACC 2400 
AGTAGAA AAT GCCAGGGCTT G ATG GAAGAG CTGTGTAGCC CTTTCTATGC CAA AGCCAGG 2460 
AAATTTGGGG GGCAGGAGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 
CTGAAACCAC TGGGAATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 
CATTTGCTTA CTGACAGCAT TTTTGTTAAA ACTGTTATTC TTGAA A A AAA AAAAA AA AAA 2640 
AA 

Seq ID NO: 56 Protein sequence: 
Protein Accession * AAC39582. 1 

1 11 21 31 4] 51 

MTALDLFLTN QFSEAISYLK PRTKESMYHS LTYAT1LEMQ AMMTFDPQDI LLAGNMMKEA 60 
QMLCQRHRRK SSVTDSFSSL VNRPTLGQFT EEEIHAEVCY AECLLQRAAL TFLQDENMVS [20 
FIKGGKVRN SYQTYKELDS LVQSSQYCKG ENHPHFEGGV KXGVOAFNLT LSMLPTRILR 1 80 
LLEFVGFSGN KDYGLLQLEE GASGHSFRSV LCVMLLLCYH TFL.TFVLGTG NVNIEEAEKL 240 
LKPYLNRYPK GAIFLFFAGR IEV1KGNIDA AIRRFEECCE AQQHWKQFHH MCYWELM WCF 300 
TYKGQWKMSY FYADLLSKEN CWSKATYIYM KAAYLSMFGK EDHKPFGDDE VELFRAVPGL 360 
KUCiAGKSLP TEKFAIRKSR RYFSSNP1SL PVPALEMMYI WNGYAVIGKQ PKLTDGILE! 420 
ITKAEEMLEK GPENEYSVDD ECLVKLLKGL CLKYLGRVQE AEENFRS1SA NEKKIKYDHY 480 
LIPNALLELA LLLMEQDRNE EA1KLLESAK QNYKNYSMES RTHFRIQAAT LQAKSSLENS 540 
SKSMVSSVSL 

Seq ID NO; 57 DNA sequence 

Nucleic Acid Accession #: NM_006670.1 

Coding sequence: 1-927 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

ATGCCTGGGG GGTGCTCCCO GGGCCCCGCC GCCGGGGACG GGCGTCTGCG GCTGGCGCGA 60 
CTAGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC CCACCTCCTC GGCATCCTCC 120 
TTCTCCTCCT CGGCCCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTCCCGG AC 1 80 
CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 240 
AATCTGACCG AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACCTCTT CCTTACCGGC 300 
AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 
AGCCTCAGGC ACCTGGACTT AAGTAATAAT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 
CGCAACCTGA CACATCTAGA AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 
AATGGCACCC TGGCTGAGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 
CCCTGGGTCT GCGACTGCCA CATGGCAGAC ATGGTGACCT GGCTCAAGGA AACAGAGGTA 600 
GTGCAGGGCA AAGACCGGCT CACCTGTGCA TATCCGGAAA AAATGAGGAA TCGGGTCCTC 660 
TTGGAACTCA ACAGTGCTGA CCTGGACTGT GACCCGATTC TTCCCCCATC CCTGCAAACC 720 
TCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 780 
TATTTGAACC GCAAGGGOAT AAAAAAGTGG ATGCATAACA TCAGAGATGC CTGCAGGGAT 840 
CACATGGAAG GGTATCATTA CAGATATGAA ATCAATGCGG ACCCCAGATT AACAAACCTC 900 
AGTTCTAACT CGGATGTCCT CGAGTGA 

Seq ID NO: 58 Pmtein sequence 
Protein Accession H: NPJ106S6U 



1 11 21 31 41 51 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLFD 60 
QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 120 
SLRHLDLSNN SLVSLTYVSF RNLTHLESLH LEDNALKVLH NGTLAELQGL PH1RVFLDNN 180 
PWVCDCHMAD MVTWLKETEV VQGKDRLTCA YPEKMRNRVL LELNSADLDC DPILPPSLQT 240 
SYVFLGrVLA LIGAIFLLVL YLNRKGKKW MHNIRDACRD HMEGYHYRYE INADPRLTNL 300 
SSNSDVLE 



Seq ID NO: 59 DNA sequence 

Nucleic Acid Accession*: NM_024022 
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Coding sequence: l-1362(underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
- I I I I I I 

J ATGGGGGAAA ATGATCCGCC TGCTGTTQAA GCCCCCTTCT CATTCCOATC GCTTTTTGGC 60 
CTTGATG ATT TGAAAATAAG TCCTGTTGCA CCAGATGCAG ATGCTGTTGC TGCACAGATC 1 20 
CTGTCACTGC TGCCATTG AA GTTTTTTCCA ATCATCGTCA TTGGG ATCAT TGCATTGATA 1 80 
TTAGCACTGG CCATTGGTCT GGGCATCCAC TTCG ACTGCT CAGGGAAGTA CAG ATGTCGC 240 
TCATCCTTTA AQTGTATCGA GCTGATAGCT CGATGTGACG GAGTCTCGGA TTGCAAAGAC 300 

1 0 GGGGAGGACG AGTACCGCTG TGTCCGGGTG GGTGGTCAGA ATGCCGTGCT CCAGGTUTTC 360 
ACAGCTGCrr CGTGGAAGAC CATGTGCTCC GATGACTGGA AGGGTCACTA CGCAAATGTT 420 
GCCTGTuCCC AACTGGGTTT CCCAAGCTAT GTGAGTTCAG ATAACCTCAC AGTGACiCTCG 480 
CTGGAGGGGC AGTTCCGGGA GGAGTTTGTG TCCATCGATC ACCTCTTGCC AGATGACAAG 540 
GTG ACTGCAT TACACCACTC AGTATATGTG AGGG AGGQAT QTGCCTCTGG CCACGTGGTT 600 

1 5 ACCTTGCAGT GCACAGCCTG TGGTCATAGA AGGGGCTACA GCTCACGCAT CGTGGGTGGA 660 
AACATGTCCT TGCTCTCGCA GTGGCCCTGG CAGGCCAGCC TTCAGTTCCA GGGCTACCAC 720 
CTGTGCGGGG GCTCTGTCAT CACGCCCCTG TGGATCATCA CTGCTGCACA CTGTGTTTAT 780 
GACTTGTACC TCCCCAAGTC ATGGACCATC CAGGTGGGTC TAGTTTCCCT GTTGGACAAT 840 
CCAGCCCCAT CCCACTTGGT GGAGAAGATT GTCTACCACA GCAAGTAC AA GCCAAAG AGG 900 

20 CTGGGCAATG ACATCGCCCT TATGAAGCTG GCCGGGCCAC TCACGTTCAA TQAAATGATC 960 
CAGCCTGTGT GCCTGCCCAA CTCTGAAGAG AACTTCCCCG ATGGAAAAGT GTGCTGGACG 1020 
TCAGGATGGG GGGCCACAGA GGATGGAGGT GACGCCTCCC CTGTCCTGAA CCACGCGGCC 1080 
GTCCCTTTGA TTTCCAACAA GATCTGCAAC CACAGGGACG TGTACGGTGG CATCATCTCC 1 140 
CCCTCCATGC TCTGCGCGGG CTACCTGACG GGTGGCOTGG ACAG CTGCCA GGGGGACAGC 1 200 

25 GGGGGGCCCC TGGTGTGTCA AGAGAGGAGG CTGTGGAAGT TAGTGGGAGC GACCAGCTTT 1 260 
GGCATCGGCT GCGCAGAGGT GAACAAGCCT GGGGTGTACA CCCGTGTCAC CTCCTTCCTG 1320 
GACTGGATCC ACGAGCAGAT GGAGAGAGAC CTAAAAACCTGA 

Sci] ID NO: 60 Pmtein sequence 
30 Protein Accession*: NP_076927 

1 11 21 31 41 51 

MGENDPPAVE APPSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFFP HVIGHALI 60 
35 LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 
TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 1 80 
VTALHHSVY V REGCASGHVV TLQCTACGHR RGYSSR1VGG MMSLLSQWPW QASLQFQGYH 240 
LCGGSVITPL W1ITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEKI VYHSKYKPKR 300 
LGND1 ALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWG ATEDGG DASPVLNHAA 360 
40 VPLISNKICN HRDVYGGIIS PSMLCAGYLT GGVDSCQGDS GGPLVCQERR LWKLVGATSF 420 
GIGCAEVNKP GVYTRVTSFL DWIHEQMERD LKT 

Seq ID NO: 61 DNA sequence 
Nucleic Acid Accession H: NMJ06475 
45 Coding sequence: 28-2538 (underlined .-sequences correspond to start arid stop codoas) 

I 11 21 31 41 51 

I ! I 1 I i 

AACAG AACTG CAACGGAG AG ACTCAAGATG ATTCCCTTTT TACCCATGTT TTCTCTACTA 60 

5 0 TTGCTGCTTA TTGTTAACCC T ATAAACGCC A ACAATCATT ATG ACAAG AT CTTGGCTCAT 1ZO 
AGTCGTATCA GGGGTCGGGA CCAAGGCCCA AATGTCTGTG CCCTTCAACA GATTTTGGGC 1 80 
ACCAAAAAGA AATACTTCAG CACTTGTAAG AACTGGTATA AAAAGTCCAT CTGTGGACAG 240 
AAAACGACTG TTTTATATGA ATGTTGCCCT GGTTATATGA GAATGGAAGG AATGAAAGGC 300 
TGCCCAGCAG TTTTGCCCAT TGACCATGTT TATGGCACTC TGGGCATCGT GGGAGCCACC 360 

5 5 ACAACGCAGC GCTATTCTGA CGCCTCAAAA CTGAGGGAGG AGATCGAGGG AAAGGGATCC 420 
TTCACTTACT TTGCACCG AG TAATGAGGCT TOGGACAACT TGGATTCTGA TATCCGTAGA 480 
GGTTTGGAGA GCAACGTGAA TGTTGAATTA CTGAATGCTT TACATAGTCA CATGATTAAT 540 
AAGAG AATGT TGACCAAGG A CTTAA A AAAT GGCATGATTA TTCCTTCAAT GTATA ACAAT 600 
TTGGGGCTTT TCATTAACCA TTATCCTAAT GGGGTTGTC A CTGTTAATTG TGCTCGAATC 660 

60 ATCCATGGGA ACCAGATTGC AACAAATGGT GTTGTCCATG TCATTGACCG TGTGCTTACA 720 
CAAATTGGTA CCTCAATTCA AGACTTCATT GAAGCAGAAG ATGACCTTTC ATCTTTTAGA 7S0 
GCAGCTGCCA TCACATCGGA CAT ATTGGAG GCCCTTCGAA GAGACGGTCA CTTCACACTC 840 
TTTGCTCCCA CCAATGAGGC TTTTGAGAAA CTTCCACGAG GTGTCCTAGA AAGGTTCATG 900 
GGAGACAAAG TGGCTTCCGA AGCTCTTATG AAGTACCACA TCTTAAATAC TCTCCAGTGT 960 

65 TCTGAGTCTA TTATGGGAGG AGCAGTCTTT GAGACGCTGG AAGGAAATAC AATTGAGATA 1020 
GGATGTGACG GTGACAOTAT AACAGTAAAT GGAATCAAAA TGGTGAACAA AAAGGATATT 1080 
GTQACAAATA ATGGTGTGAT CCATTTGATT G ATCAGGTCC TAATTCCTGA TTCTGCCAAA 1 140 
CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 1200 
GGCTTGGCAT CTGCTCTGAG GCCAGATGGA GAATACACTT TGCTGGCACC TGTGAATAAT 1260 

70 GCATTTTCTG ATGATACTCT CAGCATGGTT CAGCGCCTCC TTAAATTAAT TCTGCAGAAT 1320 
CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTAC A ACGGGCAAAT ACTGGAAACC 1380 
ATCGGAGGCA AACAGCTCAG AGTCTTCGTA TATCGTACAG CTGTCTGCAT TGAAAATTCA 1440 
TGCATGGAGA AAGGGAGTAA GCAAGGGAGA AACGGTGCGA TTCACATATT CCGCGAGATC 1500 
ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGCG CTTTAGCACC 1560 

75 TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC TGGAGACTGG 1620 
ACATTATTTG TGCCAACCAA TGATGCTTTT AAGGGAATGA CTAGTGAAGA AAAAGAAATT 1680 
CTGATACGGG ACAAAAATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC ACCAGGAGTT 1740 
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TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TA AAGACCAC ACAAGGAAGC 1800 
AA AATCTTTC TGAAAGAAGT AA ATGATACA CTTCTGCTG A ATGAATTGAA ATCAA A AG AA 1 860 
TCTGACATCA TGACAACAAA TGOTGTAATT CATGTTGTAG ATAAACTCCT CTATCCAGCA 1920 
GACACACCTG TTGG AAATG A TCAACTGCTG 0 AA ATACTTA ATAAATTAAT CAAATACATC 1 980 
CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAGAAATCC CCGTGACTGT CTATACAACT 2040 
AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2100 
ATTATCAAAA CTG AAGGACC CACACTAACA AAAGTCAA A A TTG AAGGTG A ACCTG AATTC 21 60 
AGACTGATTA AAGAAGGTGA AACAATAACT GAAGTGATCC ATGGAGAGCC AATTATTAAA 222D 
AAATACACCA AAATCATTGA TGGAGTGCCT GTGGAAATAA CTGAAAAAGA GACACGAGAA 2280 
GAACGAATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG AGGTGGAGAA 2340 
ACAGAAGAAA CTCTGAAGAA ATTGTTACAA GAAGAGGTCA CCAAGGTCAC CAAATTCATT 24C0 
GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAA1TA AAAGACTGCT TCAGGGACfAC 2460 
ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 2S20 
GAAGGTCGTT CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA TGTTTATACA ACCCTAAGTC 25 BO 
AATAACCTGA CCTTAGAAAA TTGTGAGAGC CAAGTTGACT TCAGGAACTG AAACATCAGC 2640 
ACAAAGAAGC AATCATCAAA TAATTCTGA A CACAAATTTA ATATTTTTTT TTCTGAATGA 2700 
GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTGTGGTAA AGGAATTGAA GAAAATATAA 2760 
CACCTTACAC CCTTTTTCAT CTTGACATTA AAAGTTCTGG CTAACTTTGG AATCCATTAG 2820 
AG AAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG AACTGTTATC 2880 
CCATTGAAAA G ACCG AGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 2940 
TATCTCTCCA TGGGAAGCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA ACTTTTTATA 3000 
TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATG TTATTTTTTA 3060 
CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATGCA TATTTTTTAA 3120 
TCTC AAACGT TTCAATAAA A CCATTTTTCA G ATATAAAG A GA ATTACTTC AAATTG AGTA 3180 
ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 

Seq ID NO: 62 Protein gequerce: 
Protein Accession II: NP_0D6466 

1 11 21 31 41 5i 
I 1 I I I I 

MIPFLPMFSL LLLL1VNPIN ANNHYDKILA HSRIRGRDQG PNVCALQQ1L GTKKKYFSTC 60 
KNWYKKSICG QKTTVLYECC PGYMRMEGMK GCPAVLPIDH VYGTLGIVGA TTTQRYSDAS 120 
KLREEIEGKG SFTYFAPSNE AWDNLDSDIR RGLESNVNVE LLNALHSHMI NKRMLTKDLK i 80 
NGMIIPSMYN NLGLFINHYP NG VVTVNCAR IIHGNQIATN GVVHVIDRVL TQIGTSIQDF 240 
EAEDDLSSF RAAA1TSDIL EALGRDGHFT LFAFTNEAFE KLPRGVLERF MGDKVASEAL 300 
MKYHILNTLQ CSES1MGGAV FETLEGNTIE FGCDGDS1TV NGIKMVNKKD IVTNNGVIHL 360 
IDQVLIPDSA KQVIELAGKQ QTTFTDLVAQ LGLAS ALRPD GEYTLLAPVN NAFSDDTLSM 420 
VQRLLKULQ NHILKVKVGL NELYNGQILE T1GGKQLRVF VYRTAVCIEN SCMEKGSKQG 480 
RNGA1HIFRE IIKPAEKSLH EKLKQDKKFS TFLSLLEAAD LKEIXTQPGD WTLFVPTNDA S40 
FKGMTSEEKE ILIRDKNALQ NI1LYHLTPG VF1GKGFEPG VTNILKTTQG SKIFLKEVND 600 
TLLVNELKSK ESDMTTNGV IHWDKLLYP ADTFVGNDQL LEILNKUKY IQIKFVRGST 660 
FKEIPVTVYT TKIITKVVEP KOiVlEGSLQ PIDCTEGPTL TKVKIEGEPE FRUKEGETI 720 
TEVIHGEPD KKYTKI1DGV PVEITOKETR EER1ITGPEI KYTRISTGGG ETEETLKKLL 780 
QHHVTKVTKF IEGGDGHLFE DEE1KRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 

Seq ID NO: 63 DNA sequence 

Nucleic Acid Accession*: NM 020974 

Coding sequence: B1-308D (underlined sequences correspond lo start and stop codons) 

1 11 21 31 41 51 
I I I I I 1 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 60 
CCGCAACCGC TGAGCCATCC ATGGGGGTCG CG GGCCGCAA CCGTCCCGGG GCGGCCTGGG 120 
CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 180 
CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 240 
ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 3C0 
AGCCTGGCTA CCAAGGGGAA GGCAGGCAGT GTGAGGACAT CGATGAATGT GGAAATGAGC 360 
TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 420 
TTGATGGCTT CATGTTGGCT CATG ACGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 480 
AOAACAATG0 CGOCTGCCAG CA'i'ACCTGl'G TCAACGTCAT GGGGAGCTAT GAGTGCTGC1' 540 
GCAAGOAGGG arnTTC^rG AGTGACAATC AtiCACACCTO CATl'CACCGC TCGGAAGAGG 600 
GCCTGAGCTG CATGAATAAG GATCACGGCT GTAGTCACAT CTG CA AG G A G GCCCCAAGGG 660 
GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 720 
TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCCGATG 780 
GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 
AGCGAGAGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 900 
ATAAACGGGT G AAACGGCGG CTG CTCATGG AAACGTGTGC TGTCAACAAT GG AG GCTGTG 960 
ACCGCACCTG TAAGG ATACT TCGACAGGTG TCCACTGCAG TTGTCCTGTT GG ATTCACTC 1 020 
TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGGAGGTT 1080 
GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTG ACTG CGGCTGCAAG A AAGGATTTA 1 140 
AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGGACCT 1200 
GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 
CCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT i320 
GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 
AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 
CACCCCGTGT GTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT 1500 
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GTCACTCTGG CATTCACCTC TCTTCAG ATG TCACCACCAT CAGGAC AAGT GTAACCTTTA 1 560 
AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 
CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 1 630 
GCAGCTCTGG CAAGCAAGTC CCAGGAGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 
5 TTATCACTGT TGAGTTTGAG CTTG A AACTA ACCA AAAGGA GGTGACAGCT TCTTGTGACC 1 800 
TGAGCTGCAT CQTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA I860 
AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 
AAAAGCCTCC CAGAACATCT GAACGCCAGG CAGAGTCGTG TGGAGTGGGC CAGGGTCATG 198D 
CAG AAAACCA ATGTGTCAGT TGC AGGGCTG GGACCTATTA TGATGG AGCA CGAGAACGCT 2040 

1 0 GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 2100 
GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 2160 
GTCiGAGG'i'Cl OTGTCAACCT GGTOAATATT CTOCAGATGG CTTTGCAOCT TGCCAGCTCT 2220 
GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 
GCCTTGCCAC CAAACATCAG GGAGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 2340 

1 5 GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGAA 2400 
CATACCAGCC TGAATTTGGA AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 2460 
ACTTTGATGG CTCCACAAAC ATAACCCAGT GTA A AA ACAG AAGATGTGGA GGGGAGCTGG 2520 
GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 2580 
AGTGTACGTG GACCATCAAC CCACCCCCCA AGCGCCGCAT CCTGATCGTG GTCCCTGAGA 2640 

20 TCTTCCTGCC CATAGAGGAC GACTGTGGGG ACTATCTGGT GATGCGGAAA ACCTCTTCAT 2700 
CCAATTCTGT GACAACATAT GAAACCTGCC AGACCTACGA ACGCCCCATC GCCTTCACCT 2760 
CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATG A AGGGAACAGC GCTAGAGGGT 2S20 
TCCAGGTCCC ATACGTGACA TATGATGAGG ACTACCAGGA ACTCATTGAA GACATAGTTC 2880 
GAGATGGCAG GGTCTATGCA TCTGAGAACC ATCAGGAAAT ACTTAAGGAT AAGAAACTTA 2940 

25 TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAGTAC ACAGCCCAGG 3000 
AGTCCCGAGA GATGTTTCCA AGATCGTTCA TCCGATTGCT ACGTTCCAAA GTGTCCAGGT 3060 
TTTTGAGACC TTACAAATGA CTCAGCCCAC GTGCCACTCA ATACAAATGT TCTGCTATAG 3120 
GGTTGGTGGG ACAGAGCTOT CTTCCITCTG CATGTCAGCA CAGTCGGGTA TTGCTGCCTC 3180 
CCGTATCAGT GACTCAT1 AG AUTiCAATTT Tl'ATAGATAA TACAGATATT TTGGTAAATT 3240 

3 0 GAACTTGGTT TTTCTTTCCC AGCATCGTGG ATGTAGACTG AGAATGGCTT TGAGTGGCAT 3300 
CAGCTTCTCA CTGCTGTGGG CGGATGTCTT GGATAGATCA CGGGCTGGCT GAGCTGGACT 3360 
TTGGTCAGCC TAGGTGAGAC TGACCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGGAGTC 3420 
TGTAGtGGAA AGGAGGCCAC AGAATAAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 3480 
■ CCGGCCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTGACCAG GCAGAACAGG 3540 

35 CAAGAGGGGA GGGAAOGAGA CCCCTGCAGG CTCCCTCCAC CCACCTTGAG ACCTGGGAGG 3 COO 
ACTCAGTTTC TCCACAG CCT TETCCAGCCT GTGTG ATACA AGTTTGATCC CAGGAACTTG 3660 
AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA GCAGAAAGAA TTAGAAATAA ATAAAAACTA 3720 
AGCACTTCTG GAGACAT 

40 Sea ID NO: 64 Protein sequence: 

Protein Accession #: NPJI66025. 1 

1 11 21 31 41 51 

45 MGVAGRNRPG AAWAVLLLLL LLPPLLLLAG AVPFGRGRAA GPQEDVDECA QGLDDCHADA 60 
LCQNTPTSYK CSCKPGYQGB (jRCJC'EDlDEC GNKLNGGCVH DCLNIPGNYR CTCFDGFMLA 120 
HDGHNCLDVD ECLENNGGCQ HTCVNVMGSY ECCCKBGFFL 31 i r > < < r ,r> rrGLSCMNK 180 
DHGCSHICKE APRGSVACEC RPOFELAKNQ RDCILTCNHG NGGCQHSCDD TADGPECSCH 240 
PQYKMHTDGR SCLEREDTVL EVTESNTTSV VDGDKRVKRR LLMETCAVNN GGCDRTCKDT 300 

50 STGVHCSCP V GFTLQLDGKT CKDIDECQTR NGGCDHFCKN IVGSFDCGCK KGFKLLTDEK 360 
SCQDVDECSL DRTCDHSCIN HPGTFACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVN 420 
TVGSYECQCH PGYKLHWNKK DCVEVKGLLP TSVSPRVSLH CGKSGGGDGC FLRCHSGIHL 480 
SSDVTTIRTS VTFKLNEGKC SLKNAELFPE GLRPALPEKH SSVKESFRYV NLTCSSGKQV 540 
PGAPGRPSTP KEMFITVEFE LETNQKEVTA SCDLSCIVKR TEKRLRKAIR TLRKAVHREQ 600 

55 FHLQLSGMNL DVAKKFPRTS ERQAESCGVG QGHAENQCVS CRAGTYYDGA RERCILCPNG 660 
TFQNEEGQMT CEPCPRPGNS GALKTPEAWN MSECGGLCQP GEYSADGFAP CQLCALGTFQ 720 
PEAGRTSCFP CGGGLATKHQ GATSFQDCET RVQCSPGHFY NTTTHRCIRC PVGTYQPEFG 780 
KNNCVSCPGN TTTDFDGSTN 1TQCKNRRCG GELGDFTGYI ESPNYPGNYP ANTECTWTM 840 
PPPBCRRILiV VPEIFLP1ED DCGDYLVMRK TSSSNSVTTY ETCQTYERPI AFTSRSKKLW 900 

60 IQFKSNEGNS ARGFQVPYVT YDEDYQELIE D1VRDGRLYA SENHQEILKD KKLKALFDV 960 
LAHPQNYFKY TAQESREMFP RSFIRLLRSK VSRFLRPYK. 

Seq ID NO: 65 DNA sequence 
Nucleic Acid Accession*: NM.007210 
65 Coding sequence: 1-1869 (underlined sequences correspond to sunt and stnp codons) 

1 11 21 31 41 51 

ATG AGGCTCC TCCGC AG ACG CCAC ATGCCC CTGCGCCTGG CCATGGTGGG CTGCGCCTTF 60 
70 GTGCTCTTCC TCTTCCTCCT GCATAGGGAT GTG AGCAGCA GAGAGGAGGC CACAGAGAAG 120 
CCGTGGCTGA AGTCCCTGGT G AGCCGG AAG GATCACGTCC TGGACCTCAT GCTGG AGGCC 1 80 
ATGAACAACC TTAGAGATTC AATGCCCAAG CTCCAAATCA GGGCTCCAGA AGCCCAGCAG 240 
ACTCTGTTCT CCATAAACCA GTCCTGCCTC CCTGGQTTCT ATACCCCAGC TGAACTGAAG 300 
CCCTTCTGGG AACGGCCACC ACAGGACCCC AATGCCCCTG GGGCAGATGG AAAAGCATTT 360 
75 CAGAAGAGCA AGTGGACCCC CCTGGAGACC CAGGAAAAGG AAGAAGGCTA TAAGAAGCAC 42D 
TGTTTCAATG CCTTTGCCAG CGACCGGATC 7CCCTGCAGA GGTCCCTGGG GCCAGACACC 480 
CGACCACCTG AGTGTGTGGA CCAGAAGTTC CGGCGCTGCC CCCCACTGGC CACCACCAGC 540 
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GTGATCATTG TGTTCCACAA CGAAGCCTG G TCCACACTGC TGCG AACAGT GTACAGCGTC 600 
CTACACACCA CCCCTGCCAT CTTGCTCAAG GAGATCATAC TGGTGGATGA TGCCAGCACA 660 
GAGGAGCACG TAAAGGAGAA GCTGGAGCAG TACGTGAAGC AGCTGCAGGT GGTGAGGGTG 720 
GTGCGGCAGG AGGAGCGGAA GGGGTTGATC ACCGCCCGGC TGCTGGGGGC CAGCGTGGCA 780 
5 CAGGCGGAGG TGCTCACGTT CCTGGATGCC CACTGTGAGT GCTTCCACGG CTGGCTGGAG 840 
CCCCTCCTGG CTCGAATCGC TGAGGACAAG ACAGTGGTGG TGAGCCCAGA CATCGTCACC 900 
ATCGACCTTA ATACTTTTGA GTTCGCCAAG CCCGTCCAGA GGGGCAGAGT CCATAGCCGA 960 
GGCA ACTTTG ACTGGAGCCT G ACCTTCGGC TGGGAAACAC TTCCTCCACA TGAGAAGCAG 1020 
AGGCGCAAGG ATGAAACATA CCCCATCAAA TCCCCGACGT TTGCTGGTGG CCTCTTCTCC 1080 

1 0 ATCCCCAAGT CCTACTTTGA GCACATCGG7 ACCTATGATA ATCAGATGGA GATCTGGGGA 1 140 
GGGGAGAACG TGGAAATGTC CTTCCGGGTG TGGCAGTGTG GGGGCCAGCT GGAGATCATC 1203 
CCCTGCTCTG TCGTAGGCCA TGTGTTCCGG ACCAAGAGCC CCCACACCTT CCCCAAGGGC ! 260 
ACTAGTGTCA TTGCTCGCAA TCAAGTGCGC CTCGCAOAGG TCTGGATGGA CAGCTACAAG 1320 
AAGATTTTCT ATAGGAGAAA TCTGCAGGCA GCAAAG ATGG CCCAAGAG AA ATCCTTCGGT 1380 

15 GACATTTCGG AACGACTGCA GCTGAGGGAA CAACTGCACT GTCACAACTT TTCCTGGTAC 1440 
CTGCACAATG TCTACCCAGA GATGTTTGTT CCTGACCTG A CGCCCACCTT CTATGGTGCC 1 500 
ATCAAGAACC TCGGCACCAA CCAATGCCTG GATGTGGGTG AGAACAACCG CGGGGGGAAG 1560 
CCCCTCATCA TGTACTCCTG CCACGGCCTT GGCGGCAACC AGTACTTTGA GTACACAACT 1620 
CAGAGGGACC TTCGCCACAA CATCGCAAAG CAGCTGTGTC TACATGTCAG CAAGGGTGCT 1 680 

20 CTGGGCCTTG GGAGCTGTCA CTTCACTGGC AAGAATAGCC AGGTCCCCAA GGACGAGGAA 1740 
TGGGAATTGG CCCAGGATCA GCTCATCAGG AACTCAGGAT CTGGTACCTG CCTGACATCC 1800 
CAGGACAAAA AGCCAGCCAT GGCCCCCTGC AATCCCAGTG ACCCCCATCA GTTGTGGCTC I860 
TTTGTCTAGG ACCCAGATCA TCCCCAGAGA GAGCCCCCAC AAGCTCCTCA GGAAACAGGA 1920 
TTGCTGATGT CTGGGAACCT GATCACCAGC TTCTCTGGAG GCCGTAAAGA TGGATTTCTA 1980 

25 AACCCACTGG GTGGCAAGGC AGGACCTTCC TAATCCTTGC AACAACATTG GGCCCATTTT 204O 
CTTTCCTTCA CACCGATGGA AGAGACCATT AGGACATATA TTTAGCCTAG CGTTTTCCTG 2100 
TTCTAGAAAT AGAGGCTGCC AAAGTAGGGA AGGCAGCTGG GGGAGGGTTC AGGGCAGCAA 2160 
TGCTGAGTTC AAGAAAAGTA CI TCAGGCTG GGCACAGTGG CTCATGCCTG AAATCCTAGC 2220 
ACTTTGGGAA GACAATGTGG GAGAATGGCT TGAGCCCAGG AGTTCAAGAC CGGCCTGAGC 22SD 

3 0 AACATAGTGA GGATCCCATC TCTACGCCCA CCCTCCCCCC GGCAAAAAAA AAAGCTGGGT 2340 
ATGGTGGCTT ATGCCTGTAG TCGCAGCTAC TCAGAAGGCT GAGGTGGGAG GATTGCTTGT 2400 
TCCCCGGAGG TTGAAGCTAC AGTGAGCCTT GATTGTGTCA CTGCACTCCA GCCTGGGCAA 2460 
CAGGTAAGAC TCTGTCTCAA AAAAAAAACA AAAAAGAAGA AGAAAAGTAC TTCTACAGCC 2520 
. ATGTCCTATT CCTTGATCAT CCAAAGCACC TGCAGAGTCC AGTGAAATGA TATATTCTGG 2580 

3 5 CTGGGCACAG TGGCTCACAC CTGTAATCCT AGCACTTTGG GAGGCCAAGG CAGGTGGATC 2640 
ACCTGAGGTC AGAAGTTTGA AACCAGCCTG GACTACATGG TGAAACTCCA TCTCTACTAA 2700 
AAGTACAAAA ATTAGCTGGG CATGATGGCA CGCACCTGCA GTCCCAGCTA CTTGGGAGGC 2760 
TGAGGCAGGA GAATCACTCG AACCCAGGAG GCAGAGGTTG CAGTGAGCCA AGACAGCACC 2820 
ATTGCACCCC AGCCTGAGCA ACAAGAGCGA AACTCCATCT CAGGAAAAAA AAAAAAAAAA 2880 

40 A 

Seq ID NO: 66 Protein sequence: 
Protein Accession ft NP.009141 

45 1 11 21 31 41 51 

I 1 I I I 1 

MRLLRRRHMP l-RWMVGCAK VLFLFLMMi) VSSRnfsATKK l'VV'J .KS1 .VSKK DHVLDLMLEA 60 
MNNLRDSMPK LQIRAPEAQQ TLFSINQSCL PGFYTPAELK PFWERPPQDP NAPGADGKAF 120 
QKSKWTPLET QEKEEGYKKH CFNAFASDR1 SLQRSLGPDT RPPECVDQKF RKCPPLATTS ISO 

50 VDVFHNEAW STLLRTVYSV LHTTPA1LLK EULVDDAST EEHLKEKLEQ YVKQLQVVRV 240 
VRQEERKGLI TARLLGAS VA QAEVLTFLDA HCECPHGWLE P1XARIAEDK TVVVSPDIVT 300 
IDLNTFEFAK PVQRGRVHSR GNFDWSLTFG WETLPPHEKQ RRKDETYPIK SPTFAGGLFS 360 
IPKSYFEHIG TYDNQMEIWG GENVEMSFRV WQCGGQLEII PCSVVGHVFR TKSPHTFPKG 420 
TSVIARMQVR LAEVWMDSYK K1FYRRNLQA AKM AQEK.SFG DISERLQLRE QLHCHNFSWY 450 

55 LHNVYPEMFV PDLTPTFYGA 1KNLGTNQCL DVGENNRGGK PL1MYSCHGL GGNQYFEYTT S40 
QRDLRHNIAK QLCLHVSKGA LGLGSCHFTG KNSQVPKDEE WELAQDQLIR NSGSGTCLTS 600 
QDKKFAMAPC NPSDPHQLWL FV 

Scq ID NO: 67 DNA sequence 
60 Nucleic Acid Accession ff: NM_0141 12 

Coding sequence: 600-44 84 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
,,11111) 

65 TTCCTCCGCG AAGGCTCCTT TGATATTAAT AGTGTTGGTG TCTTGAAACT OACGT AATGC 60 

GCGG AG ACTG AGGTCCTGAC AAGCGATAAC ATTTCTGATA AAGACCCGAT CTTACTGCAA 1 20 
TCTCTAGCGT CCTCTTTTTT GGTGCTGCTG GTTTCTCCAG ACCTCGCGTC CTCTCGATTG 1 80 
CTCTCTCGCC TTCCTATTTC TTTTTTTTTT TTTTAAACAA AAAACAACAC CCCCTCCCCT 240 
CTCCCACCCG GCACCGGGCA CATCCTTGCT CTATTTCCTT TCTCTTTCTC TCTCTCTCTC 300 

70 TCTCTTTTTT AATAAGGGTG GGGGAGGG AA AGGGGGGGGA GGCAGGAAAG ACCTTTTICT 360 
CTCCCCCCCG CAATAATCCA AGATCA ACTC TGCAAACAAC AGA AGACGGT TCATGGCTTT 420 
GGCCGCCGCG CCACCATCTT TCGGGCTGCC GAGG GTGTTC TTGAGG ATT A ATCAACAGAT 480 
GTACAGATCA GCTCTCAAAA TGTCTTCTGT GTCTTCTGAG CGTCTTCTAA GACAATTCCA 540 
TTAGCCTCCT GCTAGTTGAC TAATAG AATT AATAATTGTA AAAAGCACTC TAA AGCCACA 600 

75 TGCCTTATGA AGTCAATGCT GGGTATGATT TTACAAATAT GGTCCGGAAA AAGAACCCCC 660 
CTCTGAGAAA CGTTGCAAGT GAAGGCGAGG GCCAGATCCT GGAGCCTATA GGTACAGAAA 720 
GCAAGGTATC TGGAAAGAAC AAAGAATTCT CTGCAGATCA GATGTCAGAA AATACGGATC 780 
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AGAGTGATGC TGCACAACTA AATCATAAGG AGGAACATAG CTTGCATGTT CAAGATCCAT 840 
CTTCTAOCAG TAAGAAGGAC TTGAAAAGCG CAGTTCTGAG TGAGAAGGCT GGCTTCAATT 900 
ATGAAAGCCC CAGTAAGGGA GGAAACTTTC CCTCCTTTCC GCATGATGAG GTGACAGACA 960 
GAAATATGTT GGCTTTCTCA TTTCCAGCTG CTGGGGGAGT CTGTGAGCCC TTGAAGTCTC 1020 
CGCAAAGAGC AGAGGCAGAT GACCCTCAAG ATATGGCCTG CACCCCCTCA GGGGACTCAC 1080 
TGGAGACAAA GGAAGATCAG AAGATGTCAC CAAAGGCTAC AGAGGAAACA GGGCAAGCAC 1 140 
AG AGTGGTCA AGCCAATTGT CA AGGTTTG A GCCCAGTTTC AGTGGCCTCA AAAAACCCAC 1 200 
A AGTGCCTTC AGATGGGGGT GTAAGACTG A ATAAATCCA A AACTG ACTTA CTGGTG A ATG 1 260 
ACAACCCAGA CCCGGCACCT CTGTCTCCAG AGCTTCAGGA CTTTAAATGC AATATCTGTG 1320 
GATATGGTTA CTACGGCAAC QACCCCACAG ATCTGATTAA GCACTTCCGA AAGTATCACT 1380 
TAGGACTGCA TAAOCGCACC AGGCA AGATG CTGAGCTGGA CAGCAAAATC TTGGCCCTfC J 440 
ATAACATGGT GCAGTTCACJC CATTCCAAAG ACTTCCAGAA GGTCAACCGT TCTGTGTTTT 1 500 
CTGGTGTGCT GCAGGACATC AATTCTTCAA GGCCTGTTTT ACTAAATGGG ACCTATGATG 1560 
TGCAGGTGAC TTCAGOTGGA ACATTCATTG GCA7TGGACG GAAAACACCA GATTGCCAAG 1620 
GGAACACCAA GTATTTCCGC TGTAAATTCT GCAATTTCAC TTATATGGGC AACTCATCCA 1680 
CCGAATTAGA ACAACATTTT CTTCAGACTC ACCCAAACAA AATAAAAGCT TCTCTCCCCT 1740 
CCTCTGAG GT TGCAAAACCT TCAGAGAAAA ACTCTAACAA GTCCATCCCT GCACTTCAAT 1 S00 
CCAGTGATTC TGGAGACTTG GGAAA ATGGC AGG ACAAG AT AACAGTCAAA GCAGGAGATG 1860 
ACACTCCTGT TGGGTACTCA GTG CCCATAA AGCCCCTCGA TTCCTCTAG A CAAAATGGTA 1920 
CAGAGGCCAC CAGTTACTAC TGGTGTAAAT TTTGTAGTTT CAGCTGTGAG TCATCTAGCT 1980 
CACTTAAACT GCTAGAACAT TATGGCAAGC AGCACGGAGC AGTGCAGTCA GGCGGCCTTA 2040 
ATCCAGAGTT AAATGATAAG CTTTCCAGGG GCTCTGTCAT TAATCAGAAT GATCrAGCCA 2100 
AAAGTTCAGA AGGAGAGACA ATGACCAAOA CAGACAAGAG CTCGAGTGGG GCTAAAAAGA 2160 
AGGACTTCTC CAGCAAGGGA GCCGAGGATA ATATGGTAAC GAGCTATAAT TGTCAGTTCT 2220 
GTGACTTCCG ATATTCCAAA AGCCATGGCC CTGATGTAAT TGTAGTGGGG CCACTTCTCC 2280 
GTCATTATCA ACAGCTCCAT AACATTCACA AGTGTACCAT TAAACACTGT CCATTCTGTC 2340 
CC AG AGGACT TTGCAGCCCA GAAAAGCACC TTGGAGA AAT TACTTATCCG TTTGCTTGTA 2400 
GAAAAAGTAA TTOTTCCCAC TGTGCACTCT TGCTTCTGCA CTTGTCTCCT GGGCCGOCTG 2460 
( COCOA I t 1'CAGTGTT ( t 1 1 ' r I G -> 

TACTCCTCTTTCACTATGAA AGTGTGCATG AGTCCCAAGC ATCGGATGTC AAACAAGAAG 2580 
CAAATCACCT GCAAGGATCG GATGGGCAGC AGTCTGTCAA GGAAAGCAAA GAACACTCAT 2640 
GTACCAAATG TG ATTTTATT ACCCA AGTGO AAGA AG AG AT TTCCCGACAC TACAGG AG AG 2700 
CACACAGCTG CTACAAATGC CGTCAGTGCA GTTTTACAGC TGCCGATACT CAGTCACTAC 2760 
TGGAGCACTT CAACACTGTT CACTGCCAGG AACAGGACAT CACTACAGCC AACGGCGAAG 2S20 
AGGACGGTCA TGCCATATCC ACCATCAAAG AGGAGCCCAA AATTGACTTC AGGGTCTACA 2880 
ATCTGCTAAC TCCAGACTCT AAAATGGGAG AGCCAGTrTC TGAGAGTGTG GTGAAGAGAG 2940 
AGAAGCTGGA AGAGAAGGAC GGGCTCAAAG AGAAAGTTTG GACCGAGAGT TCCAGTGATG 3000 
ACCTTCGCAA TGTGACTTGG AGAGGGGCAG ACATCCTGCG GGGGAGTCCG TCATACACCC 3060 
AAGCAAGCCT GGGGCTGCTG ACGCCTGTGT CTGGCACCCA AGAGCAGACA AAGACTCTAA 3120 
GGGATAGTCC CAATGTGGAG GCCGCCCATC TGGCGCGACC TATTTATGGC TTGGCTGTGG 3 180 
AAACCAAGGG ATTCCTGCAG GGGGCGCCAG CTGGCGGAGA GAAGTCTGGG GCCCTCCCCC 3240 
AGCAGTATCC TGCATCGGGA GAAAACAAGT CCAAGGATGA ATCCCAGTCC CTGTTACGGA 3300 
GGCGTAGAGG CTCCGOTGTT TTTTGTGCCA ATTGCCTGAC CACAAAGACC TCTCTCTGGC 3360 
□AAAGAATGC AAATGGCGGA TATGTA'l'GCA ACGCGTGTGG C 2 r rACCM \ Gmi A„l 3420 
CG 1.1 TO L \G OG( UTAAAC Al'CA'lTAAAC AAAACAACGGTGAGCAGATTATTAGGAGGA 3480 
GAACAAGAAA GCGCCTTAAC CCAGAGGCAC TTCAGGCTGA GCAGCTCA* r \ CM 
GGGGCAGCAA TGAGGAGCAA GTCAATGGAA GCCCGTTAGA GAGGAGGTCA GAAGATCATC 3600 
TAACTGAAAG TCACCAG AGA GAAATTCCAC TCCCCAGCCT AAGTAAATAC GAAGCCCAGG 3660 
GTTCATTGAC TAAAAGCCAT TCTGCTCAGC AGCCAGTCCT GGTCAGCCAA ACTCTGGATA 3720 
TTCACAAAAG GATGCAACCT TTGCACATTC AG ATAAAAAG TCCTCAGGAA AGTACTGG AG 3780 
ATCCAGGAAA TAGTTCATCC GTATCTGAAG GGAAAGGAAG TTCTGAGAGA GGCAGTCCTA 3840 
TAGAAAAGTA CATGAGACCT GCGAAACACC CAAATTATTC ACCACCAGGC AGCCCTATTG 3SC0 
AAAAGTACCA GTACCCACTT TTTGGACTTC CCTTTGTACA TA ATGACTTC CAGAGTGAAG 3960 
CTGATTGGCT GCGGTTCTGG AGTAAATATA AGCTCTCCGT TCCTGGGAAT CCGC ACTACT 4020 
TG AGTCACGT GCCTGGCCTA CC AAATCCTT GCCAAAACTA TGTGCCTTAT CCCACCTTCA 4080 
ATCTGCCTCC TCATTTTTCA GCTGTTGGAT CAGACAATGA CATTCCTCTA GATTTGGCGA 4140 
TCAAGCATTC CAGACCTGGG CCAACTGCAA ACGGTGCCTC CAAGGAGAAA ACGAAGGCAC 4200 
CACCAAATGT AAAAAATGAA GGTCCCTTGA ATGTAGTAAA AACAGAGAAA GTTGATAGAA 4260 
GTACTCAAGA TGAACTTTCA ACAAAATGTG TGCACTGTGG CATTGTCTTT CTGGATGAAG 4320 
TGATGTATGC TTTGCATATG AGTTGCCATG GTGACAGTGG ACCTTTCCAG TGCAGCATAT 4380 
GCCAGCATCT TTGCACGGAC AAATATGACT TCACAACACA TATCCAGAOG GGCCTGCATA 4440 
GGAACAATGC ACAAGTGGAA AAAAATGGAA AACCTAAAGA GTAAAACCTT AGCACTTAGC 4500 
ACAATTAAAT AGAAATAGGT TTTCTTGATG GGAATTCAAT AGCTTGTAAT GTCTTATGAA 4560 
GACCTATTAA AAAAATACTT CATAGAGCCT GCCTTATCCA ACATGAAATT CCCTTCTTTT 4620 
GTTATTCTTT CTTTTGATGA GTAGGTTACC AAGATTAAAA AGTGAGATAA ATGGTCAATG 4680 
AGAAAGAATG GAAGATGGTA AACAATCACT TTTTAAA ACC TGTTAAGTCA AAACCATCTT 4740 
GGCTAATATG TACTGGGGAA ATAATCCATA AGAGATATCA CCAGACTAGA ATTAATATAT 4800 
TTATAAAGAA AGAGACCAAA ACTGTCTAGA ATTTGAAAGG GTTTACATAT TATTATACTA 4860 
AAGCAGTACT GG ACTGGCCA TTGG ACCATT TGTTCCA AAA CCC ATAAATT GTTGCCTAAA 4920 
TTTATAATGA TCATGAAACC CTAGGCAGAG GAGGAGAAAT TGAAGGTCCA GGGCAATGAA 4980 
AGAAAAATGG CGCCCTCTCA ATTTAGTCTT CTCTCATTGG CCATGTTTCA GATTTTGACC 5040 
TAG AAATGCG AGCTGTG GTT AGGCTTGGTT AG AGTGCAGC AAGC AACATG ACAGATGGTG 5100' 
GCACGCTGTT TTTACCCAGC CCTGCCTGTA CATACACATG CACACCCTCT CTGATATTTT 5160 
TGTCCTTTAG ATGTTCAAAT ACTCAGTAGT CCTTTTGTTT GCGGTTTAGA TTCATTTTGT 5220 
CCACACATGT ACCCATTTTA AAAAACAATG TCCTCG ATGC TTCTGTAGTG ATTTCATTTT 5280 
AGCCAGGTAT TTCTTTCTTG TGTGTGATGA ACCAGTATGG ATTTGCTTTT CTAAGCCTCC 5340 
TGTTGGTTAC TAATCTCACT TGGCACATTA TAACTAAAGG AATCCCCTCA ATTCAAAAGC 5400 

377 



WO 02/059377 



PCT/US02/02242 



ATAGATGG AT ACAAATGTCA GACCGTGGGT TTAATTTGTT TAGAACACAT GGCATTTCTT 5460 
C ACAAGGTAA CCTOCTGTAT TTATTTATTT TCT7TTGGTT AAATATAATT TCCAAACTTT 5520 
GTGGTCAGGC AGCGTCTAAG GTTACGTTAC CACAGACTGA CAGTTGGTAT ATGTACCAGC 5580 
CAATCCCTTC ATTAAATGTA TACAGATTTA OTTAAGTAGC ATTAAATAGG ATTCTTAGAA 5640 
5 GTATGTGCTC ATAGAACTTT TAATACTTAA GGCTTTGTAA AAACTATCCA TGAAGGGAAA 5700 
GCTCCTCAGC ATAACTGCTC AGGGAAATAG GGCTAAATAA CTGAACATTA AATAATTGGT 5760 
TAAAGGTGCT GTTAGTCGAG CCTCA ATGCT TGCTACAAGG ATGTATGTAC A AGGACTGAC 5820 
TTTAATAATT TGCATTATAT TQTCCCAACC AGTAGTTTAT TTTTTGCCAC GG AGATGTAG 5880 
AAGATATTAC AAGCTACTGG ATGCACTGTC AGATTAACTT ATTTCATTAA AGAAGTTGGG 5940 

10 AGAACAAATA GGAAAAAAAA AACTTATTTT TCTAGTAAAT ATTAATGTAT TACATTTCAA 6Q00 
ATAATGGTGC CTGACATATT GAATAATTAT TTTCTACAGT GTACGTATGC AACAAAGATA 6060 
1 f 1 (G CATTAGAG1 1 ridCCTAGCTCTTTACATlTGCAAATOTAG 6120 

CAAACAAGGT AATGAAGCAA CTATTTCTAT TGCAGTAG AT ATCCTTTTGT GTGTGTGTGT 6130 
GTGCATTAAA GTTGTAAACG GTAACATGAA ACAAATGAAA GTTCTTGCTA TAATGGTATG 6240 

1 5 GAAAACAAGA AGGAAATGAA AATATTTTTA TGCCTACTTA GGAAAAAAAG GGTAGCACTT 6300 
ATTCATTCCA AGTACTTTTT TTTTTTTAAT TTTTAAGCTC TTAACTCACA TTGTT ATGCT 6360 
TAAGATGATA AACATATATC CTCTTTTTAT TGCTTTGTCT ATGTTTCATA TGAAACATTT 6420 
CAGAAATTAT TTXGATAAGT GTTGCTGGAA TCTGCAACGC TGATTTTTTT TTGCATTCTG 6480 
TAGTCGCATT TGCACTCCAT TTTTACATTA ATTCGCAGTT GCTTTGTATC ATTGTTTTGT 6540 

20 TTGGGTTTTG TTTCTTTTTC ACAGTGCCGG GTCTTCGTTT CTTAAAGTTG GATGGCAGGT 6600 
AGAGTTCAAC CAGTTCGTGA CTGTTGTAGC GAATGAAGTT AAAAAAATGT CTTTCTGATG 6660 
TTGTGTTGTC ATTTTCATTT TTGCATTTTT TTGTTTGC AT ATTAA A A A AA GAGAA AAGAG 6720 
AAAGCAAGAG ACAGAAATCA GGACTAAGTC CTCTGCTTCA GTTTCATTGT TAACGGGCCT 6780 
TATTCTGATC TCACCTGTCG CGTAGCTCTA ATATTCACAT AAACTGAAAT AAAGAAGTGG 6840 

25 AATGAGGAGC 1TTGACATTC AAATTATGTG ATGTAATTTA TCTTCCTTAG GAATTTTGAT 6900 

GGATGCATCT CAAAATGTAT AGCCAGACTT GAG AGGTG AC AATTAA AGAT CTA AAAAAGA 6960 
GAGGAGATTC CCCCAAACAA CAATATTTAA TTTTCTTAGT AAAAAGAATA ACAGAATUCA 7020 
TCGTGGCAAT CCTTAAGCAA CATTATCTAT GTGGACTGCT TAAATCAGCA A AACACCAGA 70S0 
\GT'iTGi 71 , TTGG V 1'AT ' UG1 FTAC1 1 1 >' • \AACT TCATL'AAG 71 

3 0 CAATTTCTCT AGTGTGTCGG ACAC AAATAG GTTCTTTATT TTTGGCATGT ATGCCTTTTT 7200 
ATTTTCATTC AATTTTTTTT TTTTCTCAGA CAGACATAGT AGTATCAACT AGCATTGGAA 7260 
AATACATATC ACTATTCTTG GAATATTTAT GGTCAGTCTA CTTTTTAGTA AAATATTTTT 7320 
GGATAGCGTT GACACGATAG ATCTTATTCC ATACTTCTTT ATTATTGATA ATTTTATTTT 7380 
CATTTTTTGC TTTCATTATT ATACATATTT TGGTGGAGAA GAGGTTGGGC TTTTTTGAAA 7440 

3 5 G AGACA AAA A TTTATTATA A CACTAA ACAC TCCTTTTTTG ACATATTAAA GCCTTTATTC 7500 
CATCTCTCAA GAT ATA TT AT AAAATTTATT TTTTTAATTT AAGATTTCTG AATTATTTTA 7560 
TCTTAAATTG TGATTTTAAA CGAGCTATTA TGGTACGGAA CTTTTTTTAA TGAGGAATTT 7620 
CATGATGATT TAGGAATTTT CTCTCTTGGA AAAGGCTTCC CCTGTGATGA AAATGATGTG 7680 
CCAGCTAAAA TTGTGTGCCA TTTAAAAACT GAAAATATTT TAAAATTATT TGTCTATATT 7740 

40 CTAAATTGAG CTTTGGATCA AACTTTAGGC CAGGACCAGC TCATCCGTTC TCATTCTTCC 7800 
TTTTCTCACT CTTTCTCTCA TCACTCACCT CTGTATTCAT TCTGTTGTTT GGGATAGAAA 7860 
AATCATAAAG AGCCAACCCA TCTCAGAACG TTGTGGATTQ AGAGAGACAC TACATGACTC 7920 
CAAGTATATG AGAAAAGGAC AGAGCTCTAA TTGATAACTC TGTAGTTCAA AAGGAAAAGA 7980 
GTATGCCCAA TTCTC TCTAC ATGACATATT GAGATTTTTT TTAATCAACT '2TTAAGATAG 8040 

45 TG ATGTTCTG TTCTAAACTG TTCTGTTTTA GTGAAGG TAG ATTTTTATAA AACAAGCATG 8 1 OD 
GGGATTCTTT TCTAAGGTAA TATTAATGAG AAGGGAAAAA AGTATCTTTA ACAGCTCTTT 8160 
GTTGAAOCCT GTGGTAGCAC ATTATGTTTA TAATTGCACA TGTGCACATA ATCTATTATG 8220 
ATCCAATGCA AATACAGCTC CAAAAATATT AAATGTATAT ATATTTTAAA ATGCCTGAGG 8280 
AAATACATTT TTCTTAATAA ACTGAAG AGT CTCAGTATGG CTATTAAAAT AATTATTAGC S34Q 

5 0 CTCCTGTTGT GTGGCTGCAA AACATCACAA AGTGACCGGT CTTGAGACCT GTGAACTGCT 8^00 
GCCCTGTTTA GTAAATAAAA TTAATGCATT TCTAGAGGGG GAATATCTGC CATCCAGTGG 8460 
TGGAAATGTG GAGTAAAGAA GCTGGTGGTC TGCTTCTGTG CTGTATGCCA GCCTTTTGCC 8520 
TTAAGTTGAG AGGAGGTCAA CTTTAGCTAC TGTCTTTGGT TTGAGAGCCA TGGCAAAAAA 3580 
AAAAAAAGAA AAAAAGATCA AGTCGTCTTT GGTGAGCCAG TAAGGTGAAA GCTTGCTGAC 8640 

5 5 TGTCCAAGGC ACAAGAGAAA ATTG AGGAAT TGAAATGCA A CCTG AGTATC AAACTA AATA 8700 
TTCTAATCAA AGGTAGGTAC TGTTAG GTGG AATTCTATCA GC AGGCAACT GCAAATGAGA 8760 
AGAAGATAGA AGGACGCCCG TCGGGACTTT GGAGGGCATT GTTATTTTCC CAAAGAAAGA 8820 
CGGCCAAGGG CAGAGGCATG GATTCTTTGC AGAGCACTTC CTTTTGGTTT TTCAGTACTG 8880 
TTTCATAGAC AGTGGGCTCA CATGTTCCTG ATAGTGCTGC AGTTGCTTAG AAAGCATCCC S940 

60 AGTTAATTGC AGTAATTAGA ACTTCTGGAA TATGCTAGGG CAGAAGTATG TCAAGTATGT 9000 
CACATGAAGA AAATGTGAAA TTCAAG AGTA ATCCACACGT GAGAAACTAG ACAATGTACA 9060 
TTCATGTGTT CTCTTGAAAG GAA AGGG AGA GCTGTAAGCT 1 i iCTCTGTC rACACCCGA 120 
GAAAAGCAGG AATAACTTTA CCGTGGAAAT AATGTTTAGC TTTTATCAGA G AAAATTG TC 9S80 
CTTCTAGAGC ATAGAGTCCC A AAACTCAA7 TCTGGTTTTC CCCTGTTTTT TTTTTTTTTT 9240 
- 6 5 TTTTTCCC A A C ATATGA ACT GCAGCATATC ACTTTTTCTT TTTGTGCCTC AGGTTCCTCA 9300 
CCTGTAAAAT TGAAAAATAT ATGTATTAAT A ATATTATTA ATAATAATAA TGGTAATGTA 9360 
GTACTTGTTT GTAAAGCACT TTG AGATCCT TGGTTG A AAG GCACCATAGG AGTGCCAAGT 9420 
ATTATTATGT GGCCAAGGGG GTTATTTAAA CTGTCAGTTC CCAAAGGCCA GGAAAGGTTG 9480 
GGGTCATTTT TCTTAAAGAC GAGCTGTAAA TATCAACTAG GCAGCCAATA GTGTTGACTA 9540 

7 0 TGAAG ATGCA AA ACTATTAC TAGGCTG ATA A AATCATAGT TTCTTAATGG CTACCAATAA 9600 
GGCAAATATC ACAATAATAA ACGCCAAATT CCTTAGGGCG GACTATTTGA CAACCACATG 9660 
GAAAACTTTG GGGGAGGCAT GAGGGGGGAA CATCTCAAAA TGCCAATGTA AAATTTAACT 9720 
TACAGCAATA TTCACCAGCA GAAAATGTCT TTCATATGG A ATGATTTCAT GTTGCTAAGA 9780 
AAAAGAATTC AATTTGTAGT CCTGATTTGA ATACTAGAAT GTTGGCTATA ATAGTTCTGT 9840 

75 TCTTACAACA CATGAA ATTT TTTCGTTTTA TTTTATTTTG TTTTCATAGT GCATGTTCAT 9900 

TTCTACTCAC AAACATGTTC TTGGTGTATT TCTTATGCAA ACAATCTTCA GGCAGCAAAG 9960 
ATGTCTGTTA CATCTAAACT TGAATAATAA AGTTTTACCA CCAGTTACAC A 
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Scq ID NO: 68 Protein sequence: 
Protein Accession #: NP_054831 

5 1 11 21 31 41 51 

MPYEVNAGYD FTNMVRKKNP PLRNVASEGE GQILEPIGTE SKVSGKNKEF SADQMSENTD 60 
QSDAAELNHK EEHSLHVQDP SSSSKKDLKS AVLSEKAGFN YESPSKGGNF PSFPHDEVTD 120 
RNMLAFSFPA AGGVCEFLKS PQRAEADDPQ DMACTPSGDS LETKEDQKMS PKATEETGQA 1 80 

10 QSGQANCQGL SPVSVASK.NP QVPSDGGVRL NKSKXDLLVN DNPDPAFLSP ELQDFKCNIC 240 
UYGYYGNDPT DLIKHFRKYII LGLHNRTRQD AELDSKTLAL f INMVQFSIISK DFQKVNRSVF 300 
SGVLQDINSS RPVIXNGTYD VQVTSGGTF1 GIGRKTPDCQ GNTKYFRCKF ONITYMGNSS 360 
TELEQHFLQT HPNKJKASLP SSEVAKPSEK NSNKSIPALQ SSDSGDLGKW QDKITVKAGD 420 
DTPVGYSVPI KPLDSSRQNG TEATS YYWCK. FCSFSCESSS SLKLLEKYGK QHG AVQSGGL 480 

15 NPBLNDKLSR GSVINQNDLA KSSEGETMTK TDKSSSGAKK KDFSSKGAED NMVTSYNCQF 540 
CDFRYSKSHG PDVIVVGrLL RHYQQL.HNIH KCT1KHCPFC PRGLCSPEKH LGEITYPFAC 600 
RKSNCSHCAL LLLHLSPGAA GSSRVKHQCH QCSFTTPDVD VLLFHYESVH ESQASDVKQE 660 
ANHLQGSDGQ QSVKESKEHS CTKCDFITQV EEEISRHYRR AHSCYKCRQC EFTAADTQSL 720 
LEHFNTVHCQ EQDITTANGE EDGHAFSTIK EEPKIDFRVY NLLTPDSKMG EPVSES WKR 780 

20 EKLEEKDGLK EK.VWTESSSD DLRNVTWRGA DILRGSPSYT QASLGLLTPV SGTQEQTKTL 840 
RDSPNVEAAH LARPIYGIAV ETKGFLQGAP AGGEKSGALP QQYPASGENK SKDESQSLLR 900 
RRRGSGVFCA NCLTTKTSLW RKNANGGYVC NACGLYQKLH STFRPLNIIK QNNGEQlffiR 960 
RTRKRLNFEA LQAEQLNKQQ RGSNEEQVNG SPLERRSEDH LTESHQREIF LPSLSK.YEAQ 1020 
GSLTKSHSAQ QFVLVSQTLD IHKRMQFLHI QTKSPQESTG DPGNSSSVSE GKGSSERGSP 1080 

25 IEKYMRPAKH FNYSPPGSPI EKLYQYPLFGL PF VHNDFQSE ADWLRFWSKY KLSVPGNPHY 1140 
LSHVPGLPNP CQNYVPYPTF NLPPHFSAVG SDNDIPLDLA KHSRPGPTA NGASKEKTKA 1200 
PPNVKNEGPLNVVKTEKVDR STQDELSTKC VHCG1VFLDE VMYALHMSCH GDSGPFQCS1 1260 
CQHLCTDKYD FTTHIQRGLH rnnaqvekng kpke 

30 Seq ID NO: 69 DNA sequence 

Nucleic Acid Accession 1k XM_073879 

Coding sequence: l-387(underlincd sequences correspond to start and stop codons) 

1 11 21 31 41 51 

35 I I 1 I I I 

ATGGGGTTTG GAGACCAGGG AACGGTGGAA GGGAGCCTAG GAACGTCGAA AAAACCACCT 60 
GAAGTGAAA A TGTTTGG AGC CAGTCAAGGf TTGCTGACAA TGGA AACAAA CCAGTCCCTG 120 
GCACAAGGCA CAGGCTGCTC AGTAGTAAAG GTAGACACTG TTCTCTTCGA GAGTTTATAC 180 
CACTGCGGCT TTGAACACGG GAGCGTGATG CACTGCCTTG GGGATGATCA CCCCCAGGAA 240 

40 GACAGGAAGG CTCACTTCTC TGCCCCAGTT GCAGCCATCG CCTCTCCAGC ACCGACTCCT 300 
GTCTGTCCTG CACACCACTC AACACAGAGC ATCTGCCAGT TTCTACAGCA CTGCAGOCAG 360 
AACACTCACT TGCAGGCTGC TAACTAA 

Scq ID NO: 70 Protein sequence: 
45 Protein Accession It: X1'_073S79 

1 1! 21 31 41 51 
I I I I i I 

MGFGDQGTVE GSLGTSKKPP EVKMFGASQG LLTMETNQSL AQGTGCSVVK VDTVLFESLY 50 
50 HCGFEHGSVM HCLGDDHPQE DRKAHFSAPV AAIASPAPTP VCPAHHSTQS ICQFLQHCRQ 120 
NTHLQAAN 

Scq ID NO: 71 DNA sequence 
Nucleic Acid Accession «: AB033064 
55. Coding sequence: 826-1986 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 • 

GGGGA ACAGC AAATTCAGTC ACAG ACAATC CTCCACTCGG TCAAGAGCCA CTTTTCTCTT 60 

60 CCTGCCTTGC CCCCCCGCAG GGGGTAAGGA ACTGAGCGTT TAATCTTTAG CCG GTTGGCT 120 
ACCAUCTAAA ATl'CTACTTA TCTTAGTTTC T AGTGGATACi CT1TCTTATT TTGCCC ATGT 1 80 
TTTCTTAGAA TCCCTGTTTA ATATACTTTT GTCAGTAGTA GTATCTAGGA GTAGCAGGGA 240 
G AGTG ACAAT AAATTAGCCC CTTCTTTTTT CCCTTGTCAT TCAGGCCCCT TTTCCTCTCC 300 
AGAGGGAAAT TACCAGTAAA CTCTTCTAAA TCTTCCACCC CTTCTCAGTC ATACTGTGAA 360 

65 GAAACACACT AAAGTGGACA TTATTTG ACC AGTGAACACG AACCCAGCTT CAGGCATTGG 420 
TTTGTTGTGG CACATGGAGA AACATCTCTT TTAAAATATC TCCCAATTAC CCTTTTCACA 480 
ATTTGTATCC ACCTAGGATT TGCTGCTGGG GTAAGTCACT AGATTTATTT CTCAAAGCTC 540 
CCCTCTCTAT GAGCTGAAAG ACTGACCAAC CATGAACACT AGTAGGGGAT GGGGAAAGGG 600 
GACAGAGCAG AGCCAGTTGT TCCACACTTT GGGAAGCAGG AGTAGCTTTT ATCATCTTCC 660 

70 TCTGGGGAGC AGGCATAGAG ACATAAACTG AGTGAAAATG GGTGGAGGAA GAACTTCTAT 720 
ACCCACGAAC AACATGTGAA GAGAGAGAAC CAAACATAAA GTAAGGAGGG TAGACGTTAC 780 
ATCCAAGAGG AAATAATCCA GGCAAGGAAG CACAAGCTGA TCAAGATGTG TAGTTCTGTG 840 
GCTGCCAAGT TGTGGTTTTT GACAGATCGT CGCATCAGGG AAGACTATCC TCAAAAAGAG 900 
ATTTTACGAG CATTGAAGGC CAAATGTTGT GAGGAGGAAC TGGACTTTAG GGCTGTGGTG 960 

75 ATGGATG AGG TGGTGCTGAC AATCGAGCAA GGAAACCTGG GTCTGCGGAT CAATGG AGAG 1 020 
CTAATCACTG CCTACCCACA AGTGGTGGTA GTCAGAGTAC CAACCCCTTG GGTGCAAAGT 1080 
GATAGTGACA TCACTGTTTT GCGCCATCTA GAGAAGATGG GATGTCGGTT AATGAACCGA 1 140 
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CCTCAAGCCA TCCTG AACTG CGTTAATAAG TTCTGGACAT TTCAAGAGTT GGCTGGCCAT 1200 
GGTGTTCCTC TGCCGGATAC TTTCTCTTAT GGTGGCCACG AAAATTTTGC TAAAATGATT 1260 
GATGAGGCTG AAGTTCTGGA GTTCCCAATG GTAGTAAAGA ATACGCGGGG TCACAGAGGT 1320 
AAAGCTGTTT TCTTGGCTCG AGATAAGCAC CATTTGGCTG ATCTAAGCCA TCTTATTCGC 13S0 
5 CATGAAGCGC CATACCTGTT CCAGAAGTAT GTTAA AG AGT CTCATGGACG GGATGTACGT 1440 
GTCATTGTCG TGGGAGGCCG TGTGGTTGGC ACCATGTTAC GTTGTTCAAC AGATGGGAGA 1500 
ATGCAAAGCA ACTGCTCATT AGGTGGTGTG GGGATGATGT GCTCATTGAG TGAACAAGGG ISfiO 
AAGCAGCTAG CTATCC AGGT GTCTAATATC CTGGGGATGG ATGTGTGTGG CATTGACCTG 1 620 
CTGATGAAAG ATGACGGCTC CTTCTGCGTC TGTGAGGCCA ATGCAAATGT AGGTTTCATC 1680 

1 0 GCCTl TQATA AGGCTTGTAA TCTAGATGTA GCTGGTATCA TAGCAGACTA TGCC HC C ICC 174 
CTTCTACCCT CTGGCCGGCT CACCCGGCGT ATGTCCCTGC TCTCCGTGGT GTCCACTGCC 1 SCO 

I 1, GCCGG 1 a TGTl < 1'GAGT 1 

GCAAGTTCCA GCTCTGTTGA CAGCGACCCT GAAAGCACGG AGCGAGAGCT GCTCACCAAG 1920 
CTCCCAGGGG GCCTGTTCAA CATGAACCAG CTGCTAGCCA ATGAAATCAA ACTACTGGTG 1980 

1 5 GACTGACTCC ACTGGTAATT AACCAACAAA ACCCTTGTAA AACTTTCTTT CTTCTTTTCT 2040 

ATTTTTAAAA CCAACTTGCA ATGCTGTTCA TGGAGGATGC TCAGGAAGAT GAGAGAAAAT 2100 
TAGTAGGATT AOTTGGAGAG AGTGGG AGAT AGATG AGACC TCTGCTAGTA AGATGTTACT 2160 
TTCATTTACA AATCCTACAA ATAGAGAGGC AGAATAGGTG GGGTATAGAA AAATGTCAGG 2220 
CTCTCATAGT TACCCTTTTA AATTGCTAAA AAATGTGTAT GCTCATAGGC CATGAGGAAC 2280 

20 AAATACTTTT TTTTTTtCAT GGTCCCTTGC TTITGTTTIT GTACAAAAAA AAATGGTTTT 2340 

GCTACAAATA TCCAAGTAGC ATAACTTCAC ATTGTGTTGG AAGATTTGTC ATCAGTGAGG 24O0 
AAAACATCTG CTTAAATTAC AGGAATTTTT GTATTATACA GCTCTGAAAA TTCTGCCATT 2460 
TCCTTATTAA CTAGCAGCTT TAGTTTGTAG TTTATGAAAT CTTGAGGGGC TCTTTTACTG 2520 
GG ATTTCTTA TTTTTTTGTT TTTTCCCGCT TAATTTGGTG GGAGGTCAAA TTG AATATAA 2580 

25 CCCAATAAAG GCTTCTTAAT GACAAAATTG GCATGTTTGC ATGATGAAAT GGAAATGAAC 2640 
AGTATTGCAA TGTCCGGTAT ACAAAATAAC ATTAATTCAA TGT AGAT AAA ATTACACTAG 2700 
TTTA A AATAT GTOCATTCAC TTGTATTTGT TAUTGTrTTA GTCTTTTTTG A \GATt «< 2 i 
TCTGTTAATG TTGCTTTTTT 'il l I TrnTl'TAATACATGC TAGTCTAACA riT<XTGCTC 2820 
TATGCCFOCA-lCn-fAACAA TGGCCAAAGT GAAGAAAATG CT, 3TT1TT TGT'l " „ in 

3 0 ACACTGACTT G AAACATGTA CATTTA AAGC CTTTTATTTT TTCCCTTTTT GTTTTGGTAG 2940 

TTGGGCATTT AAATAAGG AC AAGG AAAAAT ATTTTTGGGG GCAAATCAAG AGCCTATGAG 3000 
TTCTAAGTAT AAAGCTGAAG TG ATTTCGA A TGCCAGCGTT ATATATTTGC ATTTTTCACA 3060 
TTTTACGAGG GAGTATATGT GTATGTGTGT GCACGCATGC ATGTGTATGT GTTTTGCTTT 3120 
TTGTTTCCAT CAACTAATCA AAAAGGATA A TTTAGA AA AT GGAGCATG AT GGGAAACAGA 3 1 SO 

3 5 GTTTTTGACT TTAAAAAACA GATGAGTTGT TTTCATAAGT AGACTCCACT GGGGTAGAGG 3240 
TATTCACCTT AAAACATAGG GTGAGTAGAT GCTTTTTTAG GCCTTTTTGT GTATATGTAC 3300 
GTTGTTTGTT TTTTTCCTTT TGTTTCTAGC CTGTTCAGTG TACAGTTTAT TCAAGGCTAC 3360 
ATGCTTTTCT TTAATGCTTC TGGCTATGCA TTTTCTCTTT TTACATATAG GATTTGQGAT 3420 
TGGGGGTGGG TTGGATGTTT TTGTTTGGGG ACTTATTTAG TAGTATTGAG TCTCTTATAG 3480 

40 CCCTACTCTT AAGCCTTCAA TACTGTCCAC TCTTTATATT CCTTTACTTG CAGAATTTAT 3540 

AAAAGCCCCC AAACTGCATA TAATATGAGC CTTTAAAACA TGGGTAAAAC TAATCCCATT 3600 
GATGGGTTTG GATGGTATGT TAAGAAATGG AGATGCTGCA GAGCCCAACG TAATTTTTTA 3660 
A ACAGCAAGT TTTCCATCTC CCTACGAATC CTCTGA AGCT TTTACCCAAG CCCTTTCTTG 3720 
CCTCTCCAGT GCTATTTTCC TTCAGATGUA CCTTAAACAT AATTTCTTGG ACACTACTAG 3780 

45 AGAGACTTCG AGGCAATAAT AAAACATCAG TATTAACCAG CTATAACAGA GGVJTGATCA 3840 
TGCTTACTTG TACAGTTTTT CCCCCGTTTT AAA A AGO A AT G'f AATAAAAT TTGTTTTTTC 3900 
CATAGAATTA AATAATATTA AAATTGAGTG AAAGGTTGAT TGTTGATGAA TAGAATAGTA 3960 
CCTCTCATCT GTGCAGTGTC TCATTTCACC TCAGAGAAAA GGATACATAA GAGGAGTTTG 4020 
TAATTTATCT TAGGATATTC TAATTGCATT TAAAAGAACT TATCTTGCGC AGGGTAAATG 40S0 

50 GGGGACTCAC ATACATATAT TAATACCTCT GACTCATTAA CAGAAAGAAA TACTTGGTAC 4140 
TTCTTTCGCT GAATG ACCAT ACTGTGG AGG ATGCATACTA TTTGGTATAG AGAAATAAAT 4200 
GAGGAAGAAA G AACTG CTTA ATTAAATTAT CATTCATATG TTCATATAGA GACCATCTGG 4260 
TTGCCATGTG TATTATGACA CATACACTTT GAATAGTTAC ATATCACAAG TATGTAGTTC 4320 
ATGTTTGTGT TGGTGGGGTA AGGCATCAGG AAAAATGTAG TTAGTCTTTT CTTAACTTAT 4380 

5 5 ACCAAATTAA CCA ACTATAT TATAGGAAAT ATGTGAAATT AGTTCATTAG CTTTATTCAC 4440 
TATTATGCAT TCACATGATA TTAAAACGTA CACTCACATG TTAGAATGAA AAGAGCAGTA 4500 
GTTATCTTAG ATTTTAAAAA CATGGATATC TTCTTGAATT CCTTCAAGAT TGAGGTAGAG 4560 
AATAAGAGCA AATCATTCTG GAAGTACCTT AAGGAAACAA ACAGCAGCAG ATATTTAGGT 4620 
TAAACTTATT TTCATAATTG TTTAATAACT TTTGTATAAT CTTCATTGCT ATTATGAGAG 4680 

60 AGAATGTATA TATCAAATAT GTGTAATGAT AAAATCTGAA TTGTAAAATT TI'I GTATATT 4740 
GTTAAAATTG TAATTCTAAA TTGTATTTCA AAAATGATTA TTTCI'G ATAT TGT'TTITATG 4800 
TCACCCATGA TGAAAACTGG ACTTTATATA TCTAAACAT; CJ (i TATCS rATTCTA I 
TAAAATTTTT AATAGTTTTT TTCTTTTTTG GTGCCTATAA TTGATTGGTC ATTTCTGCTG 4920 
GCTTTTCTCC AATGAACATT GAAATCTTCC TGTATATGTT ACCAATAAGA AAACTACCCT 4980 

65 GGAACAGTAG AAAAACCCAA CAAGAGACTT GGCATTCATC AAGCACATTA TCAGACTTTG 5040 
AGAACATATT GAAGGCATTG ACTTTGAAAA TCATCTCTTT TTCTCA AG AA GAAAGCAATG 5100 
GAGAAGCAAA TTTGTTTCAT TCAGTGAATC CCCAGTTTGG GGCTTGTGGG GCTTAGAGAC 5160 
ATTGTGA AAT CAAATCTTGT GTTATACTTT TCTCCTGGCT CACTTTTTTT GAG AAGGTTT 5220 
ATGGGCTATT TG GCTGGTGA GAC ACGATCC CCTCCTAAGA AA ATGTAGGT GCTCAGACAG 5280 

7 0 GTAACCACTG CTGCTACTGT TTTTATTTGT TTGTTTGTTC AATTTTATTT AAGATTTGTT 5340 
TTTGTTGTAC TAGGATTTTA AAAAATGTAA TATATTGCAG GATTTATAAC CAG 

Seq ID NO: 72 Protein sequence: 
Protein Accession #: BAA86552 



1 11 21 31 41 51 
I I I I 1 I 
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MCSSVAAKXW FLTDRRIRED YPQKE1LRAL KAKCCEEELD FRAVVMDEVV LT1EQGNLGL 60 
WNGEUTAV FQVVVVRVPT PWVQSDSDIT VLRHLEKMGC RLMNRPQAIL NCVNKFWTFQ 120 
ELAGHGVPLP DTFSYGGHEN FAKMIDEAEV LEFPMVVKNT RGKRGKAVFL ARDKHHLADL- 180 
SHLIRHEAPY LFQKYVKESH GRDVRVIVVG GRVVGTMLRC STDGRMQSNC SEGGVGMMCS 240 
LSEQGKQLAI QVSNILGMDV CGIDLLMKDD GSFCVCEANA NVGFIAFDKA CNLDVAGHA 300 
DYAASLLPSG RLTRRMSLLS VVSTASETSE PELGPPASTA VDNMSASSSS VDSDPESTER 360 
ELLTKLPGGL FNMNQLLANE IKLLVD 

Scq SD NO: 73 DNA sequence 

Nucleic Acid Accession ft XMJM0080.2 

Coding sequence: 159-1 104 (underlined sequences correspond to start and stop codons) 
I 11 21 31 41 51 

CTGAGTGGGG GCGGGGACTG CTGGAGTTGCGGGGCCTGCCTGGGGTAGGQ CGGGGCAGGA 60 
CAGCTTGGAG ATAGGGCCCG GAATTGCGGG CGTCACTCTG CTCCTCCGAC CTAGCCAGGC 120 
GTGAGGG AGT GACAGCAGCG CATTCGCGGG ACGAGAGCGATGAGTGAGAA CGCCGCACCA 1 80 
GGTCTGATCT CAGAGCTGAA GCTGGCTGTG CCCTGGGGCC ACATCGCAGC CAAAGCCTGG 240 
GGCTCCCTGC AGGGCCCTCC AGTTCTCTOC CTGCACGGCT GGCTGGACAA TGCCAGCTCC 300 
TTCGACAGAC TCATCCCTCT TCTCCCGCAA GACTTTTATT ACGTTGCCAT GG ATTTCGG A 360 
GGTCATGGGC TCTCGTCCCA TTACAGCCCA GGTGTCCCAT ATTACCTCCA GACTTTTGTG 420 
AGTGAGATCC GAAGAGTTGT GGCAGCCTTG AAATGGAATC GATTCTCCAT TCTGGGCCAC 480 
AGCTTCGGTG GCGTCGTGGG CGGAATGTTT TTCTGTACCT TCCCCGAG AT GGTGGATAAA S40 
CTTATCTTGC TGGACACGCC GCTCTTTCTC CTG GAATCAG ATG AAATGG A G AACTTGCTG 600 
ACCTACAAGC GGAGAGCCAT AGAGCACGTG CTGCAGGTAG AGGCCTCCCA GGAGCCCTCG 660 
CAGGTGTTCA GCCTGAAGCA GCTGCTGCAG AGGTTACTGA AGAGCAATAG CCACTTGAGT 720 
(JACiTCi G V 1 r 1GC \AAGA GGAACCACGA AGGTGGCCAC AGOTCTGGTT 780 
CTGAACAGAG ACCAGAGGCP G< 1/ iCA < iAGAACAGCA TTGACTTCAT CAGCAGGGAG 840 
C i GTG( 3C AT f" VTCAG GAAGCTGCAG GCCCATGTCC TGTTGATCAA AG ;A(5TCC V S00 
GGATATTTTG ATTCAAGACA GAATTACTCT GAGAAGGAGT CCCTGTCGTT CATGATAGAC 960 
ACGATG AAAT CCACCCTCAA AGAGCAGTTC CAGTTTGTGG AAGTCCCAGG CAATCACTGT 1 020 
GTCCACATGA GCGAACCCCA GCACGTGGCC AGTATCATCA GCTCCTTCTT ACAGTGCACA 1080 
CACATGCTCC CAGCCCAGCT GTAGCTCTGG GCCTGGAACT ATGAAGACCT AGTGCTCCCA 1 140 
GACTCAACAC TGGGACTCTG AGTTCCTGAG CCCCACAACA AGGCCAOGGA TGGTGGGGAC 1200 
AGGCCTCACT AGTCTTGAGG CCCAGCCTAG GATGGTAGTC AGGGGAAGGA GCGAGATTCC 1260 
AACTTCAACA TCTGTG ACCT CAAGGGGGAG ACAGAGTCTG GGTTCCAGGG CTGCTTTCTC 1320 
CTGGCTAATA ATAAATATCC AGCCAGCTGG AGGAAGGAAG GGCAGGCTGG GCCCACCTAG 1380 
CCTTTCCCTG CTGCCCAACT GGATGGAAAA TAAAAGGTTC TTGTATTCTC A 

Seq ID NO: 74 Protein sequence: 
Protein Accession ft XP_O4008O.l 

! 11 21 31 41 51 

I I I I ! I 

MSENAAPGU SELKLAVPWG HIAAKAWGSL QCPPVLCI ,1 1(3 VVLDNASSFDR L1PLLPQDFY 60 
YVAMDI-'GGIIG LSSHYSPGVP YYLQTfVSHl RRVVAAl.KWN RFSILGHSFG GVVGGMFFCT 120 
FPEMVDKL1L LDTPLFLLES DEMENLLTYK RRAIEHVLQV EASQEPSHVF SLKQLLQRLL 1 80 
KSNSHLSEEC GELLLQRGTT KVATGLVLNR DQRLAWAENS 1DF1SRELCA HSIRKLQAHV 240 
LLKAVHGYF DSRQNYSEKE SLSFMIDTMK STLKEQFQFV EVPGNHCVHM SEPQHVASII 300 
SSFLQCTHML PAQL 

Seq ID NO: 75 DNA sequence 

Nucleic Acid Accession #: NM.005794 

Coding sequence: 434-1 276 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GGTTCCCTTC CACGCTGTGA AGCTTTGTTC TTTTGGTCTT CATG ATAAAT CTTGCTGCTG 60 
CTCACtCGTT GGGTCCGTGC CACCTTTAAG AGCTGTAACA CTCACCGCGA AGGTCTGCAA 120 
CTTCACTCCT GGGGCCAGCA AGACCACGAA TGCACCG AGA GGAATGAACA ACTCTGG ACA 1 80 
CACCATCTTT AAGAACCGTA ATACTCACCG CAAfiGGTCTG CAACTTCATT CTTGAAG'l'CA 240 
OTGAGGCCAA GAACCCATCA ATTCCGTACA CATlTTGuTG ACTTTGAAGA GACTGTOACC 300 
TATCACCAAG TGGTGAGACT ATTGCCAAGC AGTGAGACTA TTGCCAAGTG GTG AG ACC AT 360 
CACCAAGCGG TGAGACTATC ACCTATCGCC AAGTGGCCTG ATTCAGCAGG AAGCATCTCA 420 
GACACCAACC ACTATOCTGT CAGCAGTTGC CCGGGGCTAC CAGGGCTGGT 7TCATCCCTG 480 
TG CTAGGCTT TCTGTGAGG A TG AGCAGCAC CGGGATAG AC AGGAAGGGCQ TCCTGGCTAA 540 
CCGGGTAGCC GTGGTCACGG GGTCCACCAG TGGGATCGGC TTTGCCATCG CCCGACGTCT 600 
GGCCCGGGAC GGGGCCCACG TGGTCATCAG CAOCCCGAAG CAGCAGAACG TGGACCGGGC 660 
CATGGCCAAG CTGCAGGGGG AGGGGCTGAG TGTGGCGGGC ATTGTGTGCC ACGTGGGGAA 720 
GGCTGAGGAC CGGGAGCAGC TGGTGGCCAA GGCCCTGGAG CACTGTGGGG GCGTCGACTT 780 
CCTGGTGTGC AGCGC AGGGG TCAACCCTCT GGTAGGG AGC ACTCTGGGGA CCAGTGAGCA 840 
GATCTGGGAC AAGATCCTAA GTGTGAACGT G AAGTCCCCA GCCCTOCTGC TGAGCCAGTT 900 
GCTGCCCTAC ATGGAGAACA GGAGGGGTGC TGTCATCCTG GTCTCTTCCA TTGCAGCTTA 960 
TAATCCAGTA GTGGCGCT.GG GTGTCTACAA TGTCAGCAAG ACAGCGCTGC TGGGTCTCAC 1020 
TAGAACACTG GCATTGGAGC TGGCCCCCAA GGACATCCGG GTAAACTGCG TGGTTCCAGG 1080 
A ATTATA AAA ACTG ACTTCA GCAAAGTGTT TCATGGGAAT GAGTCTCTCT GGAAGAACTT 1140 
CAAGGAACAT CATCAGCTGC AGAGGATTGG GG AGTCAGAG GACTGTGCAG GAATCGTGTC 1200 
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CTTCCTGTGC TCTCCAGATG CCAGCTACGT CAACGGGGAG AACATTGCGG TGGCAGGCTA 1260 
CTCCACTCGO CTCTGAGAGG AGTGGGGGCG GCTGCGTAGC TGTGGTCCCA GCCCAGGAGC 1320 
CTGAGGGGGT GTCTAGGTGA TCATTTGGAT CTGGAGCAGA GTCTGCCATT CTGCCAGACT ! 380 
AGCAATTTGG GGGCTTACTC ATGCTAGGCT TGAGGAAGAA GAAAAACGCT TCGGCATTCT 1440 
5 CC 

Seq ID NO: 76 Protein sequence: 
Protein Accession ik NP.O0S785 

10 1 II 21 31 41 51 

I ] I 1 ! I 

MLSAVARGYQ GWFHPCARLS VRMSSTG IDR KG VLANRVAV VTGSTSGIGF AIARRLARDG 60 
AHVVISSRKQ QNVDRAMAKL QGEGLSVAGI VCHVGKAEDR EQLVAKALEH CGGVDFLVCS 120 
AG VNPLVGST LGTSEQIWDK ILSVNVKSP A LLLSQLLPYM ENERGA VILV SS1AAYNPV V 1 SO 
15 ALGVYNVSKT ALLGLTRTLA LELAPKDIRV NCVVPGIIKT DFSKVFHGNE SLWKNFKEHH 240 
QLQRIGESED CAG1VSFLCS PDASYVNGEN IAVAGYSTRL 



Seq ID NO: 77 DNA sequence 
20 Nucleic Acid Accession ft NM_002425 

Coding sequence: 26-1 453 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

25 AAAGAAGGTA AGGGCAGTGA GAATGATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 
AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 120 
TGCCCAGCAA TACCTAGA A A AGTACTAC AA CCTCG AAAAG G ATGTGAAAC AGTTTAGAAG 1 SO 
AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAACTTCC 'ITGHGTTGGA 240 
j M (IS'' I T i I CGCAAGC II 1 

3 0 TCCTG ACGTT GGTC ACTTC A G CTCCTTTCC TGGCATGCCG A AGTGG AGGA AAACCCACCT 360 
TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 
TGAGAAAGCT CTGAAAGTCT GGGAAGAGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 480 
AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 
TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGA 600 

35 TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 
CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTG AAGC 720 
TTTGATGTAC CCACTCTACA ACTCATTCAC AG AGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 
TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 
GGTGCCCACA AAATCTGTTC CTTCGGGATC TGAGATGCCA GCCAAGTGTG ATCCTGCTTT 900 

40 GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AG A ATATCTG TTCTTTAAAG ACAGATATTT 960 
TTGGCGAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CATTTTGGCC 1020 
CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 
TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1140 
AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 

45 CAAGGAAAAG AAGAAA ACAT ACTTCTTTOC AGCGGACAAA TAC1GGAGAT TTGATGAAAA 1260 
TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 1320 
GCCTA AGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1 380 
ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAGCTG 1440 
GTTACATTGC TAG GCGAGAT AGGGGGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA 1500 

50 ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTTTTCCTGCATG TTCTGTGACT 1560 
GAAGAAGATG AGCCTTGCAG ATATCTG CAT GTGTCATG AA GAATGTTTCT GGAATTCTTC 1 620 
ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTGCAAT AGGTGAGAGA 1680 
ATGTATTTTC ATAGATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1740 

55 m 

Seq ID NO: 78 Protein seanence: 



75 



60 i i ] i i i 

MHLAFLVLLC LPVCSAYPLS GAAK'BEDSNK DLAQQYLEKY YNLEKDVK >\ J' iJ jN VI 60 
K1QGMQKFLG LEVTGKLDTD 1 LliVMRKPRC GVPDVGHFSE IT j '1 1 VH [ ILTYRWNYT 120 
PDLPRDAVDS AIEKADCVWE EVTPLTFSRX YEGEADIMIS FAVKK1IGDFY SFDGPGHSLA 180 
HAYPPGPGLY GDIHFDDDEK WTEDASGTNL FLVAAHELGH SLGLFHSANT EALMYPLYNS 240 
6 5 FTELAQFRXS QDDVNGIQSL YGPFPASTEE PLVPTKSVPS GSEMPAKCDP ALSFDAISTL 300 

RGEYLFFKDR YFWRRSHWNP EPEFHLISAF WFSLPSYLDA AYEVNSRDTV FIFK.GNEFWA 3 60 
IRGNEVQAGY PRGEHTLGFP PTIRKIDAAV SDKEKKKTYF FAADKYWRFD ENSQSMEQGF 420 
PRLIADDFPG VEPKVDAVLQ AFGFFYFFSG SSQFEFDPNA RMVTKILKSN SWLHC 

70 Seq ID NO: 79 DNA sequence 

Nucleic Acid Accession ft NM_000493.1 

Coding sequence: 1-2043 (underlined sequences correspond to start and stop codons) 



ATGCTGCCAC AAATACCCTT TTTGCTGCTA GTATCCTTG A ACTTGGTTCA TGG AGTGTTT 60 
TACGCTG A AC G ATACCA AAC GCCCACAGGC ATA AAAGGCC CACTACCC AA CACCAAGACA 1 20 
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CAGTTCTTCA TTCCCTACAC CATAAAGAGT AAAGGTATAG CAGTA AGAGG AGAGCAAGGT 1 80 
ACTCCTGGTC CACCAGGCCC TGCTGG ACCT CGAGGGCACC CAGGTCCTTC TGGACCACCA 240 
GGAAAACCAG GCTACGGAAG TCCTOGACTC CAAGGAGAGC CAGGGTTGCC AGGACCACCG 300 
GGACCATCAG CTGTAGGGAA ACCAGGTGTG CCAGGACTCC CAGGAAAACC AGGAGAGAGA 360 
5 GGACCATATG GACCAAAAGG AGATGTTGGA CCAGCTGGCC TACCAGGACC CCGGGGCCCA 420 
CCAGGACCAC CTGGAATCCC TGGACCGGCT GGAATTTCTG TGCCAGGAAA ACCTGGACAA 480 
CAGGGACCCA CAGGAGCCCC AGGACCCAGG GGCTTTCCTG GAGAAAAGGG TGCACCAGGA 540 
GTCCCTGGTA TGAATGGACA GAAAGGGGAA ATGGGATATG GTGCTCCTGG TCGTCCAGGT 600 
GAGAGGGGTC TTCCAGOCCC TCAGGGTCCC ACAGGACCAT CTGGCCCTCC TGGAGTGGGA 660 

1 0 AAAAG AGGTG A AAATGGGGT TCCAGGACAG CCAGGCATCA AAGGTOATAG AGGTTTTCCG 720 
GGAGAAATGG GACCAATTGG CCCACCAGGT CCCA \GGO • 1 1 HjIljh -\CGAGGGCCA 780 
GAAGGCATTG GAAAGCCAGG AGCTGCTGG A GCCCCAGGCC AGCCAGGGAT TCCAGGAACA 840 
AAAGGTCTCC CTGGGGCTCC AOGAATAGCT GGGCCCCCAG GGCCTCCTGG CTTTGGQAAA 900 
CCAGGCTTGC CAGGCCTGAA GGGAGAAAGA GGACCTGCTG GCCTTCCTGG GGGTCCAGGT 960 

1 5 GCCAAAGGGG AACAAGGGCC AGCAGGTCTT CCTGGGAAGC CAGGTCTGAC TGGACCCCCT 1020 
GGGAATATGG'gACCCCAAGG ACCAAAAGGC ATCCCGGGTA GCCATGGTCT CCCAGGCCCT 1080 
AAAGGTGAGA CAGGGCCAGC TGGGCCTGCA GGATACCCTG GGGCTAAGGG TGAAAGGGGT 1140 
TCCCCTGGGT CAGATGGAAA ACCAGGGTAC CCAGGAAAAC CAGQTCTCGA TGGTCCTAAG 1200 
GGTAACCCAG GGTTACCAGG TCCAAAAGGT GATCCTGGAG TTGGAGGACC TGCTGGTCTC 1260 

20 CCAGGCCCTG TGGGCCCAGC AGGAGCAAAG GGAATGCCCG GACACAATGG AGAGGCTGGC 1320 
CCAAGAGGTG CCCCTGGAAT ACCAGGTACT AGAGGCCCTA TTGGGCCACC AGGCATTCCA 1380 
GGATTCCCTG GGTCTAAAGG GGATCCAGGA AGTCCCGGTC CTCCTGGCCC AGCTGGCATA 1440 
GCAACTAAGG GCCTCAATGG ACCCACCGGG CCACCAGGGC CTCCAGGTCC AAGAGGCCCC 1500 
TCTGGAGAGC CTGGTCTTCC AGGGCCCCCT GGGCCTCCAG GCCCACCAGG TCAAGCAGTC 1560 

25 ATGCCTG AGG GTTTTATAAA GGCAGGCCAA AGGCCCAGTC TTTCTGGGAC CCCTCTTGTT 1 620 
AGTGCCA ACC AGGGGGTAAC AGGAATGCCT GTGTCTGCTT TTACTGTTAT TCTCTCCAAA 1 680 
GCTTACCCAG CAATAGGAAC TCCCATACCA TTTGATAAAA T1TTGTATAA CAGGCAACAG 1740 
CATTATGACC CAAGGACTGQ AATCTTTACT TGTCAGATAC CAGGAATATA CTATTTTTCA ! 800 
TACCACGTGC ATGTGAAAGG GACTCATGTT TGGGTAGGCC TGTATAAGAA TGGCACCCCT 1 SCO 

3 0 GTA ATGTACA CCTATGATG A ATACACCAAA GGCTACCTGG ATCAGGCTTC AOGG AGTGCC 1 920 
ATCATCGATC TCACAGAAAA TGACCAGGTG TGGCTCCAGC TTCCCAATGC CGAGTCAAAT 1980 
GGCCTATACT CCTCTGAGTA TGTCCACTCC TCTTTCTCAG GATTCCTAGT GGCTCCAATG 2040 
TGAGTACACC CCACAGAGCT AATCTAAATC TTGTGCTAGA AAAAGCATTC TCTAACTCTA 2100 
CCCCACCCTA CAAAATGCAT ATGGAGGTAG GCTGAAAAGA ATGTAATTTT TATTTTCTGA 2160 

35 AATACAGATT TGAGCTATCA GACCAACAAA CCTTCCCCCT GAAAAGTGAG CAGCAACGTA 2220 
AAAACGTATG TGAAGCGTCT CTTGAATTTC TAGTTAGCAA TCTTAAGGCT CTTTAAGOTT 2280 
TTCTCCAATA TTAAAAAATA TCACCAAAGA AGTCCTGCTA TGTTAAAAAC AAACAACAAA 2340 
AAACAAAGCA ACAAAAAAAA AAATTAAAAA A AAAAACAGA AATAGAGCTC TAAGTTATGT 2400 
GAAATTTGAT TTGAGAAACT CGGCATTtCC TTTTTAAAAA AGCCTGTTTC TAACTATGAA 2450 

40 TATGAGAACT TCTAGGAAAC ATCCAGGAGG TATCATATAA CTTTGTAGAA CTTAAATACT 2520 
TGAATATTCA AATTTAAAAG ACACTGTATC CCCTAAAATA TTTCTGATGG TGCACTACTC 2580 
TGAGGCCTGT ATGGCCCCTT TCATCAATAT CTATTCAAAT ATACAGGTGC ATATATACTT 2640 
GTTA AAGCTC TTATATAAAA AAGCCCCAAA ATATTGAAGT TGATCTGAAA TGCAAGGTGC 2700 
m tCA/ iCC'ITl AAACF TTTC 1 170 / VAGCTTT TTATATACO 60 

45 ACJCATAACTT GGAAACAGGT ATCTGACCTA TTC ITAT1TA GTTAACACAA GTGTGATTAA 2820 
TTTGATTTCT TTAATTCCTT ATTGAATCTT ATGTGATATG ATTTTCTGGA TTTACAGAAC 2880 
ATTAGCACAT GTACCTTGTG CCTCCCATTC AAGTGAAGTT ATAATTTACA CTGAGGGTTT 2940 
CAAAATTCGA CTAGAAGTGG AGATATATTA TTTATTTATG CACTGTACTG TAmTTATA 3000 
TTGCTGTTTA AAACTTTTAA GCTGTGCCTC ACTTATTAAA GCACAAAATG TTTTACCTAC 3060 

5 0 TCCTTATTTA CGACACAATA A AATAACATC A ATAGATTTT TAGGCTGAAT TAATTTGAA A 3 1 20 
GCAGC AATTT GCTGTTCTCA ACCATTCTTT C AAGGCTTTT CATTCGACAC A ATA AAATAA 3 180 
CATCAATAGA TTTTTAGG 

Seq ID NO: 80 Protein sequence: 
55 Protein Accession ik NP_000484.I 



1 11 21 31 41 51 

6 0 MLPQffFLLL VSLNLVHGVF YAERYQTPTG IKGPLPNTKT QPFIPYTOiS KGIAVRGEQG 60 

TPGPPGPAGP BGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 120 
Gl'YGPKGDVG PAGLPGPRGPPGPPGJPGPA G1SVPGKPCQ QGPTC.APGPR GPPGP.KGAPG 180 
VPGMNGQKGE MGYGAI'GRPG ERGLPGPQGP TGPSGPPGVG KRGENGVPGQ PGKGDRGFP 240 
GEMGPIGPPG PQGPPGERGP EGIGKPGAAG APGQPGZPGT KGLPGAPGIA GPPGPPGFGK 300 

65 PGLPGLKGER GPAGLPGGPG AKGEQGPAGL PGKPGLTGPP GNMGPQGPKG IPGSHGLPGP 360 
KGETGPAGPA GYPGAKGERG SPGSDGKPGY FOKPGLDGPK. GNPGLPGPKG DPGVGGPPGL 420 
PGPVGPAGAK GMPGHNGEAG PRGAPG1PGT RGPIGPPGIP GFPGSKGDPG SPGPPGPAGI 480 
ATKGLNGPTG PPGPPGPRGP SGEPGLPGPP GPPGPPGQAV MPEGFOCAGQ KPSLSGTPLV 540 
SANQGVTGMP VSAFTV1LSK AYPA1GTPIP FDKILYNRQQ HYDPRTGIFT CQIPG1YYFS 600 

70 YHVHVKGTHV WVGLYKNGTP VMYTYDEYTK G YLDQASGSA IIDLTENDQV WLQLPNAESN 660 
GLYSSEYVHS SFSGFLVAPM 

Seq ID NO: 8 1 DNA sequence 
Nucleic Acid Accessions: NM_00078fi 
75 Coding sequence; 332-1861 (underlined sequences correspond lo start and stop codons) 
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CGCGATTCTC AGOOATTGAT CCGCCTCTTC AGGTAAGTTA TCTTCCGGCC CCGTACCACT SO 
QTGCCACAGG CGCAGCCCGC TTCCTCAGGT GCCCTATCCC GCGCAGAAGA CCACGGCTTC 120 
ACAGAGTGTT ATTTAAGGGC GTGGCCAGCG GAACATCCCG CCCCATTCTG TGACGCACGG 1 BO 
5 GGTGGCGCGC GTGGGACCCG AGGGGTGGGG CTGGGTTTAG TAGGAGACCT GGGGCAAGGC 240 
CCCCTGTGGA CGACCATCTG CCAGCTTCTC TCGTTCCGTC GATTGGGAGG AGCGGTGGCG 300 
ACCTCGGCCT TCAGTGTTTC CGACGGAGTG AATGGCGGCG QCGGCTGGGA TGCTGCTGCT 360 
GGGCTTGCTG CAGGCGGGTG GGTCGGTGCT GGGCCAGGCG ATGGAGAAGG TGACAGGCGQ 420 
CAACCTCTTG TCCATGCTGC TGATCGCCTG CGCCTTCACC CTCAGCCTGG TCTACCTGAT 480 
1 0 CCGTCTGGCC GCCGGCCACC TGGTCCAGCT GCCCGCAGGG GTGAAAAGTC CTCCATACAT 540 
Tl TCTCCCCA A1TCCATTCC TTGGGCATGC CATAGCATTT GGGAAAAGTC CAA'iTGAATT GOO 
TCTAGAAAAT GCATATGAGA AGTATGGACC TGTATTTAGT TTTACCATOCi TAGGCAAGAC 660 
ATTTACTTAC CTTCTGGGGA GTGATGCTGC TGCACTGCTT TTTAATAGTA AAAATGAAGA 720 
CCTGAATGCA GAAGATGTCT ACAGTCGCCT GACAACACCT GTGTTTGGGA AGGGAGTTGC 780 

1 5 ATACG ATGTG CCTAATCCAG TTTTCTTGO A GCAGAAGAAA ATGTTAAAA A GTGGCCTTA A B40 
CATAGCCCAC TTTAAACAGC ATGTTTCTAT AATTGAAAAA GAAACAAAGG AATACTTTGA 900 
GAGTTGGGGA GAAAGTGGAG AAAAAAATGT GTTTGAAGCT CTTTCTGAGC TCATAATTTT 960 
AACAGCTAGC CATTGTTTGC ATGGAAAGGA AATCAGAAGT CAACTCAATG AAAAGGTAGC 1020 
ACAGCTGTAT GCAGATTTGG ATGGAGGTTT CAGCCATGCA GCCTGGCTCT TACCAGGTTG 1080 

20 GCTGCCTTTG CCTAGTTTCA GACGCAGQGA CAGAGCTCAT CGGGAAATGA AGGATATTTT 1 140 
CTATAAGGCA ATCC AG AAAC GCAGACAGTC TCA AGAAAAA ATTGATGACA TTCTCCAAAC 1 200 
TTTACTAGAT GCTACATACA AGGATGGGCG TCCTTTGACT GATGATGAAG TAGCAGGGAT 1260 
GCTTATTGGA TTACTCTTGG CAGGGCAGCA TACATCCTCA ACTACTAGTG CTTGGATGGG 1320 
CTTCTTTTTG GCCAGAGACA AAACACTTCA AAAAAAATGT TATTTAGA AC AGAAAACAGT 1 3 B0 

25 CTGTGGAGAG AATCTGCCTC CTTTAACTTA TGACCAGCTC AAGGATCTAA ATTTACTTGA 1440 
TCGCTGTATA AAAGAAACAT TAAGACTTAG ACCTCCrATA ATGATCATGA TGAGAATGGC 1500 
CAGAACTCCT CAGACTGTGG CAGGGTATAC CATTCCTCCA GGACATCAGG TGTGTGTTTC 1560 
TCCCACTG TC A A TCA A A G AC TTAAACJACfC AT0BGTAGAA C6CCTGGACT TTAATCCTGA 1620 
TCGCTACTTA CAGGATAACC CAGCATCAGG GGAAAAGTTT GCCTATGTGC CATTTGGAGC 1680 

30 TGGGCGTCAT CGTTGTATTG GGGAAAATTT TGCCTATGTT CAAATTAAGA CAATTTGGTC 1740 
CACTATGCTT CGTTTATATG AATTTGATCT CATTGATGGA TACTTTCCCA CTGTGAATTA 1800 
TACAACTATG ATTCACACCC CTGAGAACCC AGTTATCCGT TACAAACGAA GATCAAAAJG 1 860 
AAAAAGGTTG CAAGGAACGA ATATATGTGA TTATCACTQT AAGCCACAAA GGCATTCGAA 1920 
GAGAATGAAG TGTACAAAAC AACTCTTGTA GTTTACTGTT TTTTTAAGTG TGTAATTCTA 1980 

3 5 AAAGCCAGTT TATGATTTAG GATTTTGTTA ACTGA ATGGT TCTATCA AAT ATAATAGCAT 2040 
TTGACACATT TTCTA ATAGT TATGATACTT ATACATGTGC TTTCAGGAAG TTCCTTGGTG 2100. 
AAACAATTGT TGAGGGGGGA TCTAGGTAAT TGGCAGATTC TAAATAATAT AATTTCCAGA 2160 
TAGTAATTTT A AGAGTACTC ATCGCTCTTG CCAAATAAGT TCAGGGTATT CAAATCTTGG 2220 
ACTAGTCCTG CAAGGTATAA AGAATAAAAA TCCCAGTGAG ATACTTGGAA ACCACAGTTT 2280 

40 ATTATTATTT ATCTGGGCAA TTATTGTGTG TGTGAGGATG G AAGGGTAGG GA ATAATCGA 2340 
ACATCTAAAG CCTTGAATAA GAGAATACTA ATFGTTTTGG TATGATGATA CTCAGAAATG 2400 
GAGATATTAT AGGAAAAAGA AATCCTTTGG AATTTTAACT AAAATCACTG CATATGGGAA 2460 
ATTAAGAG AT CCAGGACCAT ATTTGATAAG AGTTCCTAAA AATAATGTAA TTATTAATGC 2520 
TAAAGACTGC TCATGTATCT TGATCTAATT ACTAAATAAA TTACATAT1T ATTTACCTGA 25B0 

45 TAAATATGTA TCrAGTTCTA CAAGGTCACA TTTATGTGGA AG'IT-C'AAAGT CAAGTCCTTA 2640 
GGGGATA ATT TTGTTTTGGG CTCAGTTGTT CCCTGCTTCC TTTTTTTTTT Tf i"TTTTTTT 2700 
TTGAGATGGA GTCTCGCTCT GTTGCCCAGG CTGGAGTGCA GTGGTGCGAT CTCAGCTCAC 2760 
TGCATCCTCT GCCTCCCGGG TTCAAGCAAT TCTCTGCCTC AGCCTCCCAA GTAGTTGGGA 2820 
TTACAGGCAC CTGCCACCAT GCCTGGCTAA TTTTTTGTAT TTTTAGTAGA GACGGGGGTT 2880 

50 TCACTATGTT GGCTAGGCTG GTCTTGAACT CCTGAGCCTC GTGAGTCCAC CCGCCTTGGC 2940 
CTCCCAAAGT GCTGGGATTA CAGGCATGAG CCACCGCACC TQGCCTTCCC TGCTTCCTCT 3000 
CTAGAATCCA ATTAGGGATG TTTGTTACTA CTCATATTGA TTAAAACAGTTAACAAACTT 3060 
TTTTCTTTTT AAAATGTGAG ATCAGTGAAC TCTGGTTTTA AGATAATCTG AAACAAGGTC 3120 
CTTGGGAGTA ATAAAATTGG TCACATTCTG TAAAGCACAT TCTGTTTAGG AATCAACTTA 3 ISO 

55 TCTCAAATTG TAACTCGGGG CCTAACTATA TGAGATGGCT GAAAAAATAC CACATCGTCT 3240 
GTTTTCACTA GGTGATGCCA AAATATTTTG CTTTATGTAT ATTACAGTTC TTTTTAAAAC 3300 
ACTGGAAGAC TCATGTTAAA CTCTAATTGT GAAGGCAGAA TCTCTGCTAA TTTTTCAGAT 3360 
TAAAATTCTC TTTGAAAAAA T 

60 Seq ID NO: 82 Protein sequence: 

Protein Accession #; NP_000777 

1 11 21 31 41 51 

6 5 M AAA AGMLLL GLLQAGGSVL GQAMEK.VTGG NLLSMLLIAC AFTLSLVYLI RLAAGHLVQL 60 
PAGVKSPPY1 FSPIPFLGHA IAFGKSPIEF LENAYEKYGP VFSFTMVGKT FTYLLGSDAA 1 20 
ALLFNSKNED LNAEDVYSRL TTPVFGKGVA YDVPNPVFLE QKKMLKSGLN IAHFKQHVSI 1 80 
IEKETKEYFE S WGESGEKNV FEALSELIIL TASHCLHGfCE tRSQLNEKVA QLYADLDGGF 240 
SHAAWLLPGW LPLPSFRRRD RAHRBIKD1F YKAIQKRRQS QEK1DD1LQT LLDATYKDGR 300 

70 PLTDDEVAGM LIGLLLAGQH TSSTTSAWMG FFLARDKTLQ KKCYLEQKTV CGENLPPLTY 360 
DQLKDLNLLD RCiKETLRLR PPIMIMMKMA RTPQTVAGYT IPPGHQVCVS PTVNQRLKDS 420 
WVERLDFNPD RYLQDNPASG EKFAYVPFGA GRHRCIGENF AYVQIKTIWS TMLRLYEFDL 480 
1DGYFPTVNY TTMiHTPENP VIRYKRRSK 

75 Scq ID NO: 83 DNA sequence 

Nucleic Acid Accession M: NM.00655 1 .2 

Coding sequence: 64-336 (underlined sequences correspond to start and stop cottons) 
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I 11 21 31 41 51 

I 1 1 1 I I 

AATTCTAGAA GTCCAAATCA CTCATTGTTT GTG AAAGCTG AGCTCACAGC AA AACAAGCC 60 
5 ACCATGAAGC TGTCGOTGTG TCTCCTGCTG GTCACGCTGG CCCTCTGCTG CTACCAGGCC 120 
AATGCCG AGT TCTGCCCAGC TCTTGTTTCT G AGCTGTTAG ACTTCTTCIT CATTAGTGA A 1 80 
CCTCTGTTCA AGTTAAGTCT TGCCAAATTT G ATGCCCCTC CGGAAGCTGT TGCAGCCAAG 240 
TTAGGAGTGA AGAGATGCAC GGATCAGATG TCCCTTCAGA AACGAAGCCT CATTGCGGAA 300 
GTCCTGGTGA AAATATTGAA GAAATGTAGT GTGTGACATG TAAAAACTTT CATCCTGGTT 360 
1 0 TCCACTGTCT TTCAATGACA CCCTGATCTT CACTGCAGAA TGTAAAGGTT TCAACGTCTT 420 
GCTTTAATAA ATCACTTGCT CTAC 

Seq ID NO: 84 Protein sequence: 
^ Protein Accession ft NP_006542.1 

1 11 21 31 41 5! 
I I I I I I 

MKLSVCLLLV TLALCCYQAN AEFCPALVSE LLDFFFISEP LFKLSLAKFD APPEAVAAKL 60 
GVKRCTDQMS LQKRSLIAEV LVK1LKKCSV 

Seq ID NO: 85 DNA sequence 

Nucleic Acid Accession ft NM.002317.1 

Coding sequence: 231-1484 (underlined sequences correspond to start and stop codons) 

25 1 11 21 31 41 51 
I I I I 

GGGCCAGGAC TGAGAAAGGG GAAAGGGAAG GGTOCCACGT CCGAGCAGCC GCCTTGACTQ 60 

. j ji 11 € rt \C C< TT p 11 JTGCiTtKiA GACTGAGA1 CGTGt ( 
GCTCGCCTCC TTGGTTGAAG ATTTCTCCTT CCCTCACGTG ATTTGAGCCC CGTTTTTATT 180 

3 0 TTCTGTGAGC CACGTCCTCC TCGAGCGGGG TCAATCTGGC A AAAGGAGTG ATGCGCTTCG 240 
CCTGGACCGT GCTCCTGCTC GGGCCTTTGC AGCTCTGCGC GCTAGTGCAC TGCGCCCCTC 300 
CCGCCGCCGG CCAACAGCAG CCCCCGCGCG AGCCGCCGGC GGCTCCGGGC GCCTGGCGCC 360 
AGCAGATCCA ATGGGAGAAC AACGGGCAGG TGTTCAGCTT GCTGAGCCTG GGCTCACAGT 420 
ACCAGCCTCA GCGCCGCCGG GACCCGGGCG CCGCCGTCCC TGGTGCAGCC AACGCCTCCG 4B0 

35 CCCAGCAGCC CCGCACTCCG ATCCTGCTGA TCCGCGACAA CCGCACCGCC GCGGGGCGAA 540 
CGCGGACGGC CGGCTCATCT GGAGTCACCG CTGGCCGCCC CAGGCGCACC GCCCGTCACT 600 
GGTTCCAAGC TGGCTACTCG ACATCTAGAG CCCGCGAAGC TGGGCCCTCG CGCGCGGAGA 66D 
ACCAGACAGC GCCGGGAGAA GTTCCTGCTC TCAGTAACCT GCGGCCGCCC AGCCGCGTGG 720 
ACGGCATGGT GGGCGACGAC CCTTACAACC CCTACAAGTA CTCTGACGAC AACCCTTATT 780 

40 ACAACTACTA CGATACTTAT GAAAGGCCCA GACCTGGGGG CAGGTACCGG CCCGGATACG 840 
G CACTGGCTA CTTCCAGTAC GGTCTCCCAG ACCTGGTGGC CGACCCCTAC TACATCCAGG 900 
CGTCCACGTA CGTGCAGAAG ATGTCCATGT ACAACCTG AG ATGCGCGGCG GAGGAAAACT 960 
GTCTGGCCAG TACAGCATAC AGGGCAGATG TCAGAGATTA TGATCACAGG GTGCTGCTCA 1020 
GATTTCCCCA AAGAGTGAAA AACCAAGGGA CATCAGATTT CTTACCCAGC CGACCAAGAT 1 0S0 

45 ATl'CCTGGGA ATGGCACAGT TGTCATC AAC ATTACCACAG TATGG ATG AG TTTAGCCACT 1 1 40 
TGTACCTGCT TGATGCCAAC ACCCAGAGGA GATGGGCTGA AGGCCACAAA GCAAGTTTCT 1200 
GTCTTGAAGA CACATCCTGT GACTATGGCT ACCACAGGCG ATTTGCATGT AGTGCACACA 1260 
CACAGGGATT GAGTCCTGGC TGTTATG ATA CCTATGGTGC AGACATAGAC TGCCAGTGGA 1320 
TTG ATATTAC AGATGTAAAA CCTGGAAACT ATATCGTAAA GGTCAGTGTA AACCCCAGCT 1 380 

50 ACCTGGTTCC TGAATCTGAC TATACCAACA ATGTTGTGCG CTGTGACATT CGCTACACAG 1440 
GACATCATGC GTATGCCTCA GGCTGCACAA TTTCACCGTA TTAGAAGGCA AAGCAAAACT 1500 
CCCAATGGAT AAATCAGTGC CTGGTGTTCT GAAGTGGGAA AAAATAGACT AACTTCAGTA 1 560 
GGATTTATGT ATTTTGAAAA AGAGAACAGA AAACAACAAA AGAATTTTTG TTTGGACTGT 1620 
TTTCAATAAC AAAGCACATA ACrGGATTTT GAACGCTTAA GTCATCATTA CTTGGGAAAT 1680 

55 TTTTAATGTT TATTATTTAC ATCACTTTGT GAATTAACAC AGTGTTTCAA TTCTGTAATT 1740 
ACATATTTGA CTCTTTCAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO; 86 Protein sequence: 
Protein Accession #: NP_00230S.I 

60 

I II 21 31 41 51 
I I I I 1 1 

MRFAWTVIXL GPLQI.CALVH CAPPAAGQQQ PPREPPAAPG AWRQQIQWEN NGQVFSLLSL 60 
GSQYQPQRRR DPGAAVPGAA NASAQQPRTP ILLIRDNRTA AGRTRTAGSS GVTAGRPRPT 120 

65 ARHWFQAGYS TSRAREAGPS RAENQTAPGE VPALSNLRFP SRVDGMVGDD PYNPYKYSDD 130 
NPYYNYYDTY ERPRPGGRYR PG YGTGYFQY GLPDLVADPY YIQASTY VQK MSMYNLRCA A 240 
EENCLASTAY RADVRDYDHR VLLRFPQRVK NQGTSDPLFS RPRYSWEWHS CHQHYHSMDE 300 
FSHLYLLD AN TQRRWAEGHK ASFCLEDTSC DYGYHRKFAC TAHTQGLSPG CYDTYGADID 360 

rj q CQWIDITDVK PGNYILKVSV NPSYLVPESD YTNNVVRCDI RYTGHHAYAS GCTISPY 

Seq ID NO: 87 DNA sequence 

Nucleic Acid Accession*: NM_006419,1 

Coding sequence: 91 -420 {underlined sequences correspond to start and stop codons) 
75 I II 21 31 41 51 

TTCGGCACTT GGGAGAAGAT GTTTGAAAAA ACTGACTCTG CTAATGAGCC TGGACTCAGA 60 
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GCTCAAGTCT GAACTCTACC TCCAGACAGA ATGAAGTTCA TCTCGACATC TCTGCTTCTC 120 
ATGCTGCTGG TCAGCAGCCT CTCTCCAGTC CAAGGTGTTC TGGAGGTCTA TTACACAAGC 1 80 
TTGAGGTGTA GATGTGTCCA AGAGAGCTCA GTCTTTATCC CTAGACGCTT CATTGATCGA 240 
ATTCAAATCT TGCCCCGTGG G AATGGTTGT CCAAGAA A AG AAATCATAGT CTGGAAGAAG 300 
AACAAGTCAA TTGTGTGTGT GGACCCTCAA GCTGAATGGA TACAAAGAAT GATGGAAGTA 360 
TTGAGAAAAA GAAGTTCTTC AACTCTACCA GTTCCAGTGT TTAAGAGAAA GATTCCCTGA 420 
TGCTGATATT TCCACTAAG A ACACCTGCAT TCTTCCCTTA TCCCTGCTCT GGATTTTAGT 480 
TTTGTGCTTA GTTAAATCTT TTCCAGGGAG AAAGAACTTC CCCATACAAA TAAGGCATGA 540 
GGACTATGTG AAAAATAACC TTGCAGGAGC TGATGGGGCA AACTCAAGCT TCTTCACTCA 600 
CAGCACCCTA TATACACTTG GAGTTTGCAT TCTTATTCAT CAGGGAGGAA AGTTTCTTTG 660 
AAAATAGTTA TTCAGTTATA AGTAATACAG OATTATTTTG ATTATATACTTOTTGTTTAA 720 
TGTTTAAAAT TTC'ITAGAAA ACAATGGA AT C5AGAATTTAA GCCTCAAATT TGAACATGTO 780 
GCTTGAATTA AGAAGAAAAT TATGGCATAT ATTAAAAGCA GGCTTCTATG AAAGACTCAA 840 
AAAGCTGCCT GGGAGGCAGA TGGAACTTGA GCCTGTCAAG AGGCAAAGGA ATCCATGTAG 900 
TAG ATATCCT CTGCTTAA AA ACTCACTACG GAGGAG AATT AAGTCCTACT TTTA AAGAAT 960 
TTCTTTATAA AATTTACTGT CTAAGATTAA TAGCATTCGA AGATCCCCAG ACTTCATAGA 1020 
ATACTCAGGG AAAGCATTTA AAGGGTGATG TACACATGTA TCCTTTCACA CATTTGCCTT 1080 
G ACAA ACTTC TTTCACTCAC ATCTTTTTCA CTG ACTTTTT TTGTGGGGGC GGGGCCGGGG 1140 
GGACTCTGGT ATCTAATTCT TTAATGATTC CTATAAATCT AATGACATTC AATAAAGTTG 1200 
AGCAAACATT TTACTT 

Seq ID NO: 88 



Coding sequence: 3 7-477 (underlined sequences correspond lo start and stop codons) 
1 11 21 31 41 51 

CTTCTCTGGG ACACATTGCC TTCTGTTTTC TCCAGCATGC GCTTGCTCCA GCTCCTGTTC 60 
AGGGCCAGCC CTGCCACCCT GCTCCTGGTT CTCTGCCTGC AGTTGGGGGC CAACAAAGCT 120 
CAGGACAACA CTCGGAAGAT CATAATAAAG AATTTTGACA TTCCCAAGTC AGTACGTCCA 1 80 
AATGACG AAG TCACTGCAGT GCTTGCAGTT CAAACAG A AT TGAAAGAATG CATGGTGGTT 240 
AAAACTTACC TCATTAGCAG CATCCCTCTA CAAGGTGCAT TTAACTATAA GTATACTGCC 300 
TGCCTATGTG ACGACAATCC AAAAACCTTC TACTGGGACT TTTACACCAA CAGAACTGTG 360 
CAAATTGCAG CCGTCGTTGA TGTTATTCGG GAATTAGGCA TCTGCCCTGA TGATGCTGCT 420 
GTAATCCCCA TCAAAAACAA CCGGTTTTAT ACTATTGAAA XCCTAAAGGT AGAATAATGG 480 - 
AAGCCCTGTC TG'ITTGCCAC ACCCAfiCiTGA '1TTCCTCTAA AGAAACTTGG CTGGAATTTC 540 
TGCTGTGGTC TATAAAATAA ACTTCTTAAC ATGCTT 



MRLLQLLFRA SPATLLLVLC LQLGANKAQD NTRKIIKNF DIFKSVRPND EVTAVLAVQT 60 
ELKECM V VKT YL1SS1PLQG AFNYKYTACL CDDNPKTFYW DFYTNRTVQI AAWDVIREL 120 
GICPDDAAVI PIKNNRFYTI EILKV 

Seq ID NO: 91 DNA sequence 

Nucleic Acid Accession ft AK000341 

Coding sequence: 85-975 (underlined sequences correspond lo start and stop codons) 
I 11 2! 31 41 51 

GATAGCGCCG GGCAGAGGGA CCCGGCTACC CTGGACAGCG CATCGCCGCC CGCCCGGGTC 60 
GCCGCGCCAC AGCCGCTGCG GATCATGGAA CATCTAAAGG CCTTTGATGA TGAAATCAAT 120 
GCTTTTTTGG ACAATATGTT TGGACCGCGA GATTCTCGAG TCAGAGGGTG GTTCACGTTG 180 
GACTCTTACC TTCCTACCTT TTTTCTTACT GTCATGTATC TGCTCTCAAT ATGGCTGGGT 240 
AACAAGTATA TGAAGAACAG ACCTGCTCTT TCTCTCAGGG GTATCCTCAC CTTOTATAAT 300 
CTTGGAATCA CACTTCTCTC CGCGTACATG CTGGCAGAGC TCATTCTCTC CACTTGGGAA 360 
GGAGGCTACA ACTTACAGTG TCAAGATCTT ACCAGCGCAG GGGAAGCTGA CATCCGGGTA 420 
GCCAAGGTGC TTTGGTGGTA CTATTTCTCC AAATCAGTAG AGTTCCTGGA CACAATTTTC 480 
TTCGTTTTGC GGAAAAA AAC GAGTCAGATT ACTTTTCTTC ATGTATATCA TCATGCTTCT 540 
ATGTTTAACA TCTGGTGGTG TGTCTTGAAC TGGATACCTT GTGGACAAAG TTTCTTTGGA f30O 
CCAACACTGA ACAGTTTTGT CCACATTCTT ATGTACTCCT ACTATGGACT TTCTGTGTTT 660 
CCATCTATGC ACAAGTATCT TTGGTGGAAG AAATATCTCA CACAGGCTCA GCTGGTGCAG 720 
TTCGTGCTCA CCATCACGCA CACCATGAOC GCCGTCGTGA AACCGTGTGG CTTCCCCTTC 780 
GGTTGTCTCA TCTTCCAGTC ATCTTATATG CTAACGTTAG TCATCCTCTT CTTAAATTTT 840 
TATGTTCAGA CATACCGAAA AAAGCCAATG AAGAAAGATA TGCAAGAGCC ACCTGCAGGG 900 
AAAGAAGTGA AGAATGGTTT TTCCAAAGCC TACTTCACTG CAGCAAATGG AGTGATGAAC 960 
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AAGAAAGCAC AATAAAAATG AGTAACAGAA AAAGCACATA TACTAGCCTA ACAGATTGGC 1020 
TTGTTTTAAA GCAAAGACTG AATTGAAGGT TACATGTTTT AGGATAAACT AATTTCTTTT 1080 
GAGTTCATAA ATCATTTGTA CGCAGAATGT ATTAATATAT TGCTATTAGG TTAATCTGTT 1 140 
AACTGAATGC TTTGATCAGC ATTGAGGTGA TGCTCACCTC CGAGGACCTC AGAACTGGTG 1200 
5 CAGCTTCTCT CTCCCTCCCT CCCACAG ACT GAACCTTTCG CCAG A AGCTG TCCTT ATAAC 1260 
GCCTTATACG CATACACAGC CAGGAAACGT GGAGCATTGT TTCTCACAGA GAGTCTCCAA 1320 
ATAAAAAGGG TTTTGTTCAG ATTAAAATGT TTACA ACAAA ATGTTAATTA TATTCTAAAT 13B0 
ACAGGGTATG TTCTAATCTA TATTAAGCAA TAATGCCAGT GCATAATCAT TCCATTTGTT 1 440 
CCTTTAGCAA TCAACCGCAG AAAATATTAA AATGGGATCA TACACAGAAG ATAGAAAAAT 1500 

1 0 CTAGCAAAAC TTCTCTTTCT GTAAGCCAGA GTCTTGTCTA TCAOATTCCC ACAACCACTC 1560 
CTGATTCTA/ \Tl T AGTGAS ATGGTAATGA AATTGGTATT TA'JTITAAAT ATTAGTTATT 1620 
CTAAGGAGAA AAAAATGCTT CTGCAAGATT TTCATAATTC AGGGGCTGTG GATAGGATl'G 1 680 
TTCCTCTGTT TCCCTAATCA TTCATCTGTT CATGTCTCCC TCTTGTGCCA GTCAGCCTAG 1740 
GTTATACAGA TGCCATGCTC CACACCACGA GCAGTGTACA AATCTGGCTG CCCGTTTACT 1800 

15 TTCTGAGCAA GCACTGGAGT CCACTCCGAC CTTTITCTTT GAACATGCAT GCTGCTGGAA I860 

TATGTATAAA TCAGAACTAG CAGAAGTAGC AGAGTGATGG GAGCAAAATA GGCACTGAAT 1920 
TCGTCA ACTC TTTTTTGTG A GCCTACTTGT G AATATTACC TCAG ATACCT GTTGTCACTC 1980 
TTCACAGGTT ATTTAAGTTC TTGAAGCTGG GAGGAAAAAG ATGGAGTAGC TTGGAAAGAT 2040 
TCCAGCACTG AGCCGTGAGC CGGTCATGAG CCACGATAAA AAATGCCAGT TTGGCAAACT 2100 

20 CAGCACTCCT GTTCCCTGCT CAGGTATATO CGATCTCTAC TGAGAAGCAA GCACAAAAGT 2160 
AGACCAAAGT ATTAATGAGT ATTTCCTTTC TCCATAAGTG CAGGACTGTT ACTCACTACT 2220 
AAACTCTACC AAGAATGGAA ACCAAGAATA TTTTCTGAAG ATTTTTTTGA AGATTAATTT 2280 
ATACCCTATA AAATAAAACT TGTTAGCTTC GATGAAGTCA AAAAAAAAAA AAAAAAAAAA 

25 Scq ID NO: 92 Protein sequence: 

Protein Accession**: BAA91096.1 

1 11 21 31 41 SI 

30 MEHLKAFDDE MAFLDNMFG PRDSRVRGWF TLDSYLPTFF LTVMYIXSIW LGNKYMKNRP 60 
ALSLRGILTL YNLGITLLSA YMLAELILST WEGGYNLQCQ DLTSAGEADI RVAKVLWWYY 1 20 
FSK.SVEFLDT IFFVLRKKTS QITFLHVYHH ASMFNIWWCV LNW1PCGQSF FGPTLNSFVH 180 
' 1LMYSYYGLS VFPSMHKYLW WKKYLTQ AQL VQFVLTITHT M5AWKPCGF PFGCL1FQSS 240 
^ ^ YMLTLVILFL NFYVQTYRKK PMKKDMQEPP AGKEVKNGFS KAYFTAANGV MNKKAQ 

Scq ID NO: 93 DNA sequence 

Nucleic Acid Accession #: NM.000044 

Coding sequence: 1 1 15-3874 (underlined sequences correspond So start and stop codDns) 

40 1 11 21 31 41 51 

CGAGATCCCG GGGAGCCAGC TTOCTGGGAG AGCGGGACGG TCCGGAGCAA GCCCACAGGC 60 
AGAGGAGGCG ACAGAGGGAA AAAGGGCCGA GCTAGCCGCT CCAGTGCTGT ACAGGAGCCG 120 
AAGGGACGCACCACGCCAGCCCCAGCCCGGCTCCAGCGACAGC-C-l/'i GC< ;OTGCAGCG 180 

45 CGGCGGCTTCG\ tu C < CCGGAGCTG CCCTTTCCTC TTCGGTGAAG TTOTAAAAG 240 
CTGCTAAAGA CTCGGAGGAA GCAAGGAAAG TGCCTGGTAG GACTGACGGC TGCCTTTGTC 300 
CTCCTCCTCT CC ACCCCGCC TCCCCCCACC CTGCCTTCCC CCCCTCCCCC GTCTTCTCTC 360 
CCGCAGCTGC CTCAGTCGGC TACTCTCAGC CAACCCCCCT CACCACCCrT CTCCCCACCC 420 
GCCCCCCCGC CCCCGTCGGC CCAGCGCTGC CAGCCCGAGT TTGCAGAGAG GTAACTCCCT 480 

50 TTGGCTGCGA GCGGGCGAGC TAGCTGCACA TTGCAAAGAA GGCTCTTAGG AGCCAGGCGA 540 
CTGGGGAGCG GCTTCAGCAC TGCAGCCACG ACCCGCCTGG TTAGAATTCC GGGGGAGAGA 600 
ACCCTCTGTT TTCCCCCACT CTCTCTCCAC CTCCTCCTGC CTTCCCCACC CCGAGTGCGG 660 
AGCAGAGATC AAAAGATGAA AAGGCAGTCA GGTCTTCAGT AGCCAAAAAA CAAAACAAAC 720 
AAAAACAAAA AAGCCGAAAT AAAAGAAAAA GATAATAACT CAGTTCTTAT TTGCACCTAC 730 

5 5 TTCAGTGGAC ACTGAATTTG GAAGGTGGAG GATTTTGTTT TTTTCTTTTA AG ATCTGGGC 840 
ATCTTTTGAA TCTACCCTTC AAGTATTAAG AGACAGACTG TGAGCCTAGC AGGGCAGATC 900 
TTGTCCACCG TGTGTCTTCT TCTGCACGAG ACTTTGAGGC TGTCAGAGCG CTTTTTGCGT 960 
GGTTGCTCCC GCAAGTTTCC TTCTCTGGAG CTTCCCGCAG GTGGGCAGCT AGCTGCAGCG 1020 
ACTACCGCAT CATCACAGCC TGTTGAACTC TTCTGAGCAA GAGAAGGGGA GGCGGGGTAA 1080 

60 GGGAAGTAGG TGGAAGATTC AGCCAAGCTC AAGGATGGAA GTGCAG'ITAU GGCTGGGAAG H40 
iGTCTACCCT 1' CCAAGACCT ITCCAG If 120C 

CGTGCGCGAA GTGATCCAGA ACCCGGGCCC CAGGCACCCA GAGGCCGCGA GCCOAGCACC 1263 
TCCCGGCGCC AGTTTGCTGC TGCTGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA 1320 
GCAGCAGCAG CAGCAGCAGC AGCAGCAAGA GACTAGCCCC AGGCAGCAGC AGCAGCAGCA 1380 

65 GGGTGAGGAT GGTTCTCCCC AAGCCCATCG TAGAGGCCCC ACAGGCTACC TGGTCCTGGA 1440 
TGAGGAACAG CAACCTTCAC AGCCGCAGTC GGCCCTGGAG TGCCACCCCG AGAGAGGTTG 1500 
CGTCCCAGAG CCTGGAGCCG CCGTGGCCGC CAGCAAGGGG CTGCCGCAGC AGCTG CCAGC 1 560 
ACCTCCGGAC G AGG ATGACT CAGCTGCCCC ATCCACGTTG TCCCTGCTGG GCCCCACTTT 1620 
CCCCGGCTTA AGCAGCTGCT CCGCTGACCT TAAAGACATC CTGAGCGAGG CCAGCACCAT 1680 

70 GCAACTCCTT CAGCAACAGC AGCAGGAAGC AGTATCCGAA GGCAGCAGCA GCGGGAGAGC 1740 
GAGGGAGGCC TCGGGGGCTC CCACTTCCTC CAAGGACAAT TACTTAGGGG GCACTTCGAC 1800 
CATTTCTGAC AACGCCAAGG AGTTGTGTAA GGCAGTGTCG GTGTCCATGG GCCTGGGTGT I860 
GGAGGCGTTG GAGCATCTGA GTCCAGGGGA ACAGCTTCGG GGGGATTGCA TGTACGCCCC 1920 
ACTTITGGGA GTTCCACCCG CTGTGCGTCC CACTCCTTGT GCCCCATTGG CCGAATGCAA 1980 

75 AGGTTCTCTG CTAGACGACA GCGCAGGCAA GAGCACTGAA GATACTGCTG AGTATTCCCC 2040 
TTTCAAGGGA GGTTACACCA AAGGGCTAGA AGGCGAGAGC CTAGGCTGCT CTGGCAGCGC 2100 
TGCAGCAGGG AGCTCCGGGA CACTTGAACT GCCGTCTACC CTGTCTCTCT ACAAGTCCGG 2160 
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AGCACTGGAC CAGGCAGCTG CGTACCAGAG TCGCGACTAC TACAACTTTC CACTGGCTCT 2220 
GGCCGGACCG CCGCCCCCTC CGCCGCCTCC CCATCCCCAC GCTCGCATCA AGCTGGAGAA 2280 
CCCGCTGGAC TACGGCAGCG CCTGGGCGGC TGCGGCGGCG CAGTGCCGCT ATGGGG ACCT 2340 
GGCGAGCCTG CATGGCGCGG GTGCAGCGGG ACCCGGTTCT GGGTCACCCT CAGCCGCCGC 2400 
TTCCTCATCC TGGCACACTC TCTTCACAGC CGAAGAAGGC CAGTTGTATG GACCGTGTGG 2460 
TGGTGGTGGG GGTGGTGGCG GCGGCGGCGG CGGCGGCGGC GGCGGCGGCG GCGGCGGCGG 2520 
CGGCGGCGGC GAGGCGGGAG CTGTAGCCCC CTACGSCTAC ACTCGGCCCC CTCAGGGGCT 2580 
GGCGGGCCAG GAAAGCGACT TCACCGCACC TGATGTGTGG TACCCTGGCG GCATGGTGAG 2640 
CAGAGTGCCC TATCCCAGTC CCACTTGTGT CAAAAGCGAA ATGGGCCCCT GGATGGATAG 2700 
CTACTCCGGA CCTTACGGGG ACATGCGTTT GGAGACTGCC AGGGACCATG TTTTGCCCAT 2760 
TGACTATTAC TTTCCACCCC AGAAGACCTG CCTGATCTGT GGAGATGAAG CTTCTGGGTG 2820 
TCACTATGOA GCTCTCACAT GTGGAAGCTG CAAGGTCTTC TTCAAAACAU CCGCTGAAGG 2880 
GAAACAGAAG TACCTGTGCG CCAGCAGAAA TGATTGCACT ATTGATAAAT TCCGAAGGAA 2940 
AAATTGTCCA TCTTGTCGTC TTCGGAAATG TTATGAAGCA GGGATGACTC TGGG AGCCCG 3G00 
GAAGCTGAAG AAACTTGGTA ATCTGAAACT ACAGGAGGAA GGAGAGGCTT CCAGCACCAC 3060 
CAGCCCCACT GAGGAGACAA CCCAGAAGCT GACAGTGTCA CACATTGAAG GCTATGAATG 3120 
TCAGCCCATC TTTCTGAATG TCCTGGAAGC CATTGAGCCA GGTGTAGTGT GTGCTGGACA 3180 
CGACAACAAC CAGCCCGACT CCTTTGCAGC CTTGCTCTCT AGCCTCAATG AACTGGGAGA 3240 
GAGACAGCTT GTACACGTGG TCAAGTGGGC CAAGGCCTTG CCTGGCTTCC GCAACTTACA 3300 
CGTGGACGAC CAGATGGCTG TCATTCAGTA CTCCTGGATG GGGCTCATGG TGTTTGCCAT 3360 
GGGCTGGCGA TCCTTCACCA ATGTCAACTC CAGG ATGCTC TACTTCGCCC CTGATCTGGT 3420 
TTTCAATGAG TACCGCATGC ACAAGTCCCG GATGTACAGC CAGTGTGTCC GAATGAGGCA 3480 
CCTCTCTCAA GAGTTTGGAT GGCTCCAAAT CACCCCCCAG GAATTCCTGT GCATGAAAGC 3540 
ACTGCTACTC TTCAGCATTA TTCCAGTGGA TGGGCTGAAA AATCAAAAAT TCTTTGATGA 3600 
ACTTCGAATG AACTACATCA AGGAACTCGA TCGTATCATT GCATGCAAAA GAAAAAATCC 3660 
CACATCCTGC TCAAGACGCT TCTACCAGCT CAGCAAGCTC CTGGACTCCG TGCAGCCTAT 3720 
TGCGAGAGAG CTGCATCAGT TCACTTTTGA CCTGCTAATC AAGTCACACA TUGTGAGCGT 3780 
GGACTTTCCG GAAATGATGG CAGAGATCAT CTCTG TfiCAA GTGCCCAAGA TCCTTTCTGG 3840 
GAAAGTCAAG CCCATCTATT TCCACACCCA GTGAAGCATT GGAAACCCTA TT'l CCCCACC 3900 
CCAGCTCATG CCCCCTTTCA GATGTCTTCT GCCTGTTATA ACTCTGCACT ACTCCTCTGC 3960 
AGTGCCTTGG GGAATITCCT CTATTGATGT ACAGTCTGTC ATGAACATGT TCCTGA ATTC 4020 
TATJTGCTGG GCTTTrTTTT TCTCTTTCTC TCCTTTCTTT TTCTTCTTCC CTCCCTATCT 4080 
AACCCTCCCA TGGCACCTTC AGACTTTGCT TCCCATTGTG GCTCCTATCT GTGTTTTGAA 4140 
TGGTOTTGTA TGCCTTTAAA TCTGTGATGA TCCTCATATG GCCCAGTGTC AAGTTGTGCT 4200 
TGTTTACAGC ACTACTCTGT GCCAGCCAC A CAA ACGTTTA CTTATCTTAT GCCACGGGAA 4260 
GTTTAGAGAG CTAAGATTAT CTGGGGAAAT CAAAACAAAA AACAAGCAAA CAAAAAAAAA 4320 



1 11 21 31 41 51 

1 I I I ! I 

MEVQLGLGRV YPRPPSKTYR GAFQNU-'QSV REVJQNPOIMt 11PEAARAAW GASLLI.l.QQQ 60 
QQQQQQQQQQQQQQQQQQETSPRQQQQQQG EDGSI'QAHRR UPTGYLV1.uk HQQl'SQl'QSA 120 
LECHPERGCV PEPGAAVAAS KGLPQQLPAP PDEDDSAAPS TLSLLGPTI ; P GI.SSCSADLK ISO 
DILSEASTMQ LLQQQQQEAV SEGSSSGRAR EASGAPTSSK DNYLGGTSTI 5DNAKELCKA 240 
VSVSMGLG VE ALEHLSPGEQ LRGDCMYAPL LGVPPAVRPT PCAPLAECKG SLLDDSAGKS 300 
TEDTAEYSPF KGGYTKGLEG ESLGCSGSAA AGSSGTLELP STLSLYKSGA LDEAAAYQSR 360 
DYYNFPLALA GPPPPPPPPK PHARIKLENP LDYGSAWAAA AAQCRYGDLA SLHGAGAAGP 420 
GSGSPSAAAS SSWHTLFTAE EGQLYGPCGG GGGGGGGGGG GGGGGGGGGG GGEAGAVAPY 480 
GYTRPPQGLA GQESDFTAPD VWYPGGMVSR VPYPSPTCVK SEMGPWMDSY SGPYGDMRLE 540 
TARDHVLPID YYFPFQKTCL 1CGDEASGCK YG ALTCGSCK VFFKRAAEGK QKYLCASRND 600 
CTIDKFRKKN CPSCRLRKCY EAGMTLGARK LKKLGNLKLQ EEGEASSTTS PrEETTQKXT 660 
VSH1EGYECQ PIFLNVLEAI EPG VVCAGHD NNQPDSFAAL LSSLNELGER QLVHVVKWAK 720 
ALPGFRNLHV DDQMAVIQYS WMGLMVFAMG WRSFTNVNSR MLYFAPDLVF NEYRMHKSRM 780 
YSQCVRMRHL SQEFGWLQ1T PQEFLCMKAL LLFSI1PVDG LKNQKFFDEL RMNYKELDR 840 
UACKKKNPT SCSRRFYQLT KLLDSVQP1A RELHQFTFDL LIKSI-MVSVD FPEMMAEHS 900 
VQVPK1LSGK VKPIYFHTQ 

SeqIDNO:95 DNA sequence 

Nucleic Acid Accession It NM_002497 

Coding sequence: 1 35-1472 {underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I 1 I 

GGCACGAGTA GGGGTGGCGG GTCAGTGCTG CTCGGGGGCT TCTCCATCCA GGTCCCTGGA 60 
GTTCCTGGTC CCTGGAGCTC CGCACTTGG C GCGCA ACCTG CGTGAGGCAG CGCGACTCTG 120 
GCG ACTGGCC GGCCATGCCT TCCCGGGCTG AGG ACTATG A AGTGTTGTAC ACCATTGGCA 1 80 
CAGGCTCCTA CGGCCGCTGC CAGAAGATCC GGAGGAAGAG TGATGGCAAG ATATTAGTTT 240 
GGAAAGAACT TGACTATGGC TCCATGACAG AAGCTGAGAA ACAGATGCTT GTTTCTGAAG 300 
TGAATTTGCT TCGTGAACTG A AACATCCAA AGATCGTTCG TTACTATG AT CGGATTATTG 360 
ACCGGACCAA TACAACACTG TACATTGTAA TGGAATATTG TGAAGGAGGG GATCTGGCTA 420 
GTGTAATTAC AAAGGGAACC AAGGAAAGGC AATACTTAGA TGAAGAGTTT GTTCTTCGAG 480 
TGATGACTCA GTTGACTCTG GCCCTGAAGG AATGCCACAG ACGAAGTGAT GGTGGTCATA 540 
CCGTATTGCA TCGGGATCTT AAACCAGCCA ATGTTTTCCT GGATGGCAAG CAAAACGTCA 600 
AGCTTGGAGA CTTTGGGCTA GCTAGAATAT TAAACCATGA CACGAGTTTT GCAAAAACAT 660 
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TTGTTOGCAC ACCTTATTAC ATGTCTCCTG A ACAA ATGAA TCGCATGTCC TACA ATG AG A 720 
AATCAGATAT CTGGTCATTG GGCTGCTTGC TGTATGAGTT ATGTGCATTA ATGCCTCCAT 78a 
TTACAGCTTT TAGCCAGAAA GAACTCGCTG GGAAAATCAG AGAAGGCAAA TTCAGGCGAA 840 
TTCCATACCG TTACTCTGAT GAATTGAATG AAATTATTAC OAGGATGTTA AACTTAAAGG SCO 
5 ATTACCATCG ACCTTCTGTT G AAG A AATTC TTG AG AACCC TTTA ATAGCA G ATTTGGTTG 960 

CAGACGAGCA AAGAAGAAAT CTTGAGAGAA GAGGGCGACA ATTAGGAGAG CCAGAAAAAT 1020 
CGCAGGATTC CAGCCCTGTA TTGAGTGAGC TGAAACTGAA GGAAATTCAG TTACAGGAGC I08D 
GAGAGCGAGC TCTCAAAGCA AGAGAAGAAA GATXGGAGCA GAAAGAACAG GAGCTTTGTG 1 140 
TTCGTGAGAG ACTAGCAGAG GACAAACTGGCTAGAGCAGA AAATCTGTTO AAGAACTACA 1200 

10 GCTTGCTAAA GGAACGGAAG TTCCTGTCTC TGGCAAGTAA TCCAGAACTT CTTAATCTTC 1260 
CATCCTCAGT AATTAAGAAG AAAGTTCATT TCAGTGGGGA AAGTAAAGAG AACATCATGA 1320 
CiCiAGTGAGAA TTCTGAGAGT CAGCTCACAT CTAAGTCCAA GTGCAAGGAC CTGAAGAAAA 1330 
GGCTTCACGC TGCCCAGCTG CGGGCTCAAG CCCTGTCAGA TATTGAGAAA AATTACCAAC 1440 
TGAAAAGCAG ACAGATCCTG GGCATGCGCTAGCCAGGTAG AGAGACACAG AGCTGTGTAC 1500 

15 AGGATGTAAT ATTACCAACC TTTAAAGACT GATATTCAAA TGCTGTAGTG TTGAATACTT 1S60 
GGCCCCATGA GCCATGCCTT TCTGTATAGT ACACATGATA TTTCGGAATT GGTTTTACTG 1 620 
TTCTTCAGCA ACTATTGTAC AAAATGTTCA CATTTAATTT TTCTTTCTTC TTTTAAGAAC 1 680 
ATATTATAAA AAGAATACTT TCTTGGTTGG GCTTTTAATC CTGTGTGTGA TTACTAGTAG 1740 
GAACATG AG A TGTG ACATTC TAAATCTTGG GAGAAAAAAT AATATTAGG A A AAAAATATT 1 800 

20 TATGCAGGAA OAGTAGCACT CACTGAATAG TTTTAAATGA CTGAGTGGTA TGCTTACAAT 1 SSO 
TGTCATGTCT AGATTTAAAT TTTAAGTGTG AGATTTTAAA TGTTTTTGAG CTTAGAAAAC 1920 
CCAGTTAGAT GCAATTTGGT CATTAATACC ATGACATCTT GCTTATAAAT ATTCCATTGC 19S0 
TCTGTAGTTC AAATCTGTTA GCTTTGTGAA AATTCATCAC TGTGATGTTT GTATTCTTTT 2040 
TTTTTTTCTG TTTAACAGAA TATGAGCTGT CTGTCATTTA CCTACTTCTT TGCCACTAAA 2100 

25 TAAAAGAATT CTTCAGTTA 

Scd. ID NO; 96 Protein sequence: 
Protein Accession if. NP 002483 

30 1 11 21 31 41 51 
1 I I I I I 

G1REFNPNM ANEVERJNMf TOSISGENEA RELATEDKIN ASEHOMOSAP 1ENSMPSRAE 60 
DYEVLYT1GT GSYGRCQKIR RKSDGKILVW KELDYGSMTE AEKQMLVSEV NLLRELKHPN 120 
IVRYYDRIID R.TNTTLY1 VM EYCEGGDLAS VITKGTKERQ YLDEEFVLRV MTQLTLALKE 1 B0 

35 CHRRSDGGHT VLHRDLKPAN VFLDGKQNVK. LGDFGLARIL NHDTSFAKTF VGTPYYMSPE 240 
QMNRM5YNEK SDIWSLGCLL YELCALMFPF TAFSQKELAG KJREGKFRRI PYRYSDELNE 300 
ITTRMLNLKD YHRPSVEEIL ENPLIADLV A DEQRRNLERR GRQLGEPEKS QDSSPVLSEL 360 
KLKE1QLQER ERALKAREER LEQfCEQBLCV RERLAEDKLA RAENLLKNYS LLKERKFLSL 420 
ASNFELLNLP SSVIKKK.VHF SGESKENQvlR SENSESQLTS KSK.CKDLKKR LHAAQLRAQA . 480 

40 LSDIEKNYQL KSRQILGMR 

Scq ID NO: 97 DMA sequence 
Nucleic Acid Accession ft NM_007050.2 
^ Coding sequence: 1 85-4576 (underlined sequences correspond io start and stop codons) 

1 II 21 31 41 51 
1 I I 1 I I 

CCTCCCGCCT CAGTTCGCGC CGCGCCTCGG CTTGGAACGC AGGAGCGCCG GCTCCGGGAG 60 
CCCGAGCGGA GCCAGCCGCG CGCACAGCCA GCGGCCGCGC CGGCGATGCG GGGCCACCCC 120 

5 0 GCGCCCGCCC CAGTCCCGGC CCCGGCCCCC GCGGGAAGGG GCTGAGCTGC CCGCCGCCGC 1 80 
CCGGATGGCG AGCCTCGCCG CGCTCGCCCT CAGCCTGCTC CTGAGGCTGC AGCTGCCGCC 240 
ACTGCCCGGC GCCCGGGCTC AGAGCGCCCC AGGTGGCTGT TCCTTTGATG AGCACTACAG 300 
CAACTGTGGT TATAGTGTOG CTCTAGGGAC CAATGGGTTC ACCTGGGAGC AGATTAACAC 360 
AACGGAGAAA CCA ATGCTGG ACCAGGCAGT GCCCACAGG A TCTTTCATGA TGGTGA ACAG 420 

5 5 CTCTGGGAGA GCCTCTGGCC AGAAGGCCCA CCTTCTCCTG CCAACCCTQA AGGAG AATGA 4SD 
CACCCACTGC ATCGACTTCC ATTACTACTT CTCCAGCCGT GACAGGTCCA GCCCAGGGGC 540 
CTTGAACGTC TACGTGAAGG TGAATGGTGG CCCCCAAGGG AACCCTGTGT GGAATGTGTC 600 
CGGGGTCGTC ACTGAGGGCT GGGTGAAGGC AGAGCTCGCC ATCAGCACTT TCTGGCCACA 660 
TTTCTATCAG GTGATATTTG AATCCGTCTC ATTGAAGGGT CATCCTGGCT ACATCGCCGT 720 

60 GGACGAGGTC CGGGTCCTTG CTCATCCATG CAGAAAAGCA CCTCATTTTC TGCGACTCCA 780 
AAACC jTGGAG GTGAATGTGG GGCAG A ATGC CACA fCAG '1 CATTGi i GTCC \A I 41 
GTCTCAGCAT g acaagc ttt ggctccagca atggaatggc AGGGACACGG CCCTGATGGT 900 
CACCCGTGTG GTCAACCACA GGCGCTTCTC AGCCACAGTC AGTGTGGCAG ACACTGCCCA 960 
GCGGAGCGTC AGCAAGTACC GCTGTGTGAT CCGCTCTG AT GGTGGGTCTG GTGTGTCCAA 1020 

65 CTACGCGGAG CTGATCGTGA AAGAGCCTCC CACGCCCATT GCTCCCCCAG AGCTGCTGGC 1 080 
TGTGGGGGCC ACATACCTGT GGATCAAGCC AAATGCCAAC TCCATCATCG GGGATGGCCC 1 140 
CATCATCCTG AAGGAAGTGG AATATCGCAC CACCACAGGC ACGTGGGCAG AGACCCACAT 1200 
AGTCGACTCT CCCAACTATA AGCTGTGGCA TCTGGACCCC GATGTTGAGT ATGAGATCCG 1260 
AGTGCTCCTC ACACGACCAG GTGAGGGGGG TACGGGACCG CCAGGGGCTC CCCTCACCAC 1320 

70 CAGGACCAAG TGTGCAGATC CGGTACATGG CCCACAGAAC GTGGAAATGG TAGACATCAG 1 380 
AGCCCGGCAG CTGACCCTGC AGTGGGAGCC CTTCGGCTAC GCGGTGACCC GCTGCCATAG 1440 
CTACAACCTC ACCGTGCAGT ACCAGTATGT GTTCAACCAG CAGCAGTACG AGGCCGAGGA 1500 
GGTCATCCAG ACCTCCTCCC ACTACACCCT GCGAGGCCTG CGCCCCTTCA TGACCATCCG 1560 
GCTGCGACTC TTGCTGTCTA ACCCCGAGGG CCGAATGGAG AGCGAGGAGC TGGTGGTGCA 1620 

75 GACTGAGGAA GACGTTCCAG GAGCTGTTCC TCTAQAATCC ATCCAAGGGG GGCCCTTTGA 1 6S0 
GGAGAAGATC TACATCCAGT GGAAACCTCC CAATGAGACC AATGGGGTCA TCACGCTCTA 1740 
CGAGATCAAC TACAAGGCTG TCGGCTCGCT GGACCCAAGT GCTGACCTCT CGAGCCAGAG 1800 
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GOGGAAAGTG TTCAAGCTCC GGAATGAAAC CCACCACCTC TTTGTGGGTC TGTACCCAGG 1860 
QACCACCTAT TCCTTCACCA TCAAGGCCAG CACAGCAAAG GGCTTTGGGC CCCCTGTCAC 1920 
CACTCGGATT GCCACCAAAA TTTCAGCTCC ATCCATGCCT GAGTACGACA CAGACACCCC 1980 
ATTGAATGAG ACAGACACGA CCATCACAGT GATGCTGAAA CCCGCTCAGT CCCGGGGAGC 2040 
5 TCCTGTCAOT GTTTATCAGC TGGTTGTCAA GGAGGAGCGA CTTCAGAAGT CACGGAGGGC 2 100 
AGCTGACATT ATTGAGTGCT TTTCGGTGCC CGTG AGCTAT CGGAATGCCT CCAGCCTCGA 2160 
TTCTCTACAC TACTTTGCTG CTGAGTTGAA GCCTGCCAAC CTGCCTGTCA CCCAGCCATT 2220 
TACAGTGGGT GACAATAAGA CATACAATGG CTACTGGAAC CCTCCTCTCT CTCCCCTGAA 2280 
AAGCTACAGC ATCTACTTCC AGGCACTCAG CAAAGCCAAT GGAGAGACCA AAATCAACTG 2340 

1 0 TGTTCGTCTG GCTACAAAAG CACCAATGGG CAGCGCCCAG GTG ACCCCGG GGACTCCACT 2400 
CTGCCTCCTC ACCACAGGTG CCTCC ACCC A GAA1TCTA AC AC TGTGGAGC CAGAGAAGCA 2460 
GfiTGGACAAC ACCGTGAAGA TGGCTGGCGT G a CC I 1 dill tC YG1 1 r AiCA fCAT 2520 
TCTCCTGGGC GTGATGCTCA CCATCAAAAG GAGAAGAAAT GCTTATTCCT ACTCCTATTA 2580 
CTTGTCCCAA AGGAAGCTGG CCAAGAAGCA GAAGGAGACC QAGAGTGGAG CCCAGAGGGA 2640 

1 5 GATGGGGCCT GTGGCCTCTG CCGACAAACC CACCACCAAG CTCAGCGCCA GCCGCAATGA 27O0 
TGAAGGCTTC TCTTCTAGTT CTCACGACGT CAACGGATTC ACAGATGGCA GCCGCGGGGA 2760 
GCTTTCCCAG CCCACCCTCA CGATCCAGAC TCATCCCTAC CGCACCTGTG ACCCTGTGGA 2820 
GATGAGCTAC CCCCGGGACC AGTTCCAACT CGCCATCCGG GTGGCTGACT TGCTGCAGCA 2880 
CATCACGCAG ATGAAGAGAG GCCAGGGCTA CGGGTTCAAG GAGGAATACG AGGCCTTACC 2940 

20 AGAGGGGCAG ACAGCTTCGT GGGACACAGG CAAGGAGGAT GAAAACCGCA ATAAGAATCG 3000 
ATATGGGAAC ATCATATCCT ACGACCATTC CCGGGTGAGG CTGCTGGTGC TGGATGGAGA 3Q60 
CCCQCACTCT GACTACATCA ATGCCAACTA CATTGACGGA TACCATCGAC CTCGGCACTA 3 120 
CATTGCG ACT CAAGGTCCG A TGCAGG AGAC TGTAAAGG AC TTTTGG AGAA TGATCTGGCA 3 1 80 
GGAGAACTCC GCCAGCATCG TCATGGTCAC AAACCTGGTG GAAGTGGGCA GGGTGAAATG 3240 

25 TGTGCGATAC TGGCCAGATG ACACGGAGOT CTACGGAGAC ATTAAAGTCA CCCTGATTGA 3300 
AACAGAGCCC CTGGCAOAAT ACGTCATACG CACCTTCACA GTCCAGAAGA AAGGCTACCA 3360 
TGAGATCCGG GAGCTCCGCC TCTTCCACTT CACCAGCTGG CCTGACCACG GCGTTCCCTG 3420 
CTATGCCACT GGCCD CTGC! GCITCGTCCG CCAGGTCAAG TTCCTCAACC CCCCGGAAGC 3480 
TGGGCCCATA OTGGTCCACT GCACjTGCTGG GGCTGGGCGG ACf GCl ~> k 1 >< ' .0 

3 0 TG ACACCATG C1TGACATGG CCG AG AATGA AGGGGTGGTG GACATCTTCA ACTGCGTGCG 3600 
TGAGCTCCGG GCCCAAAGGG TCAACCTGGT ACAGACAGAG GAGCAATATG TGTTTGTGCA 3660 
CGATGCCATC CTGGAAGCGT GCCTCTGTGG CAACACTGCC ATCCCTGTGT GTGAGTTCCG 3720 
TTCTCTCTAC TACAATATCA GCAGGCTGGA CCCCCAGACA AACTCCAGCC AAATCAAAGA 3780 
TGAATTTCAG ACCCTCAACA TTGTGACACC CCGTGTGCGG CCCGAGGACT GCAGCATTGG 3840 

35 GCTCCTGCCC CGGAACCATG ATAAGAATCG AAGTATGGAC GTGCTGCCTC TGGACCGCTG 3900 
CCTGCCCTTC CTTATCTCAG TGGACGGAGA ATCCAGCAAT TACATCAACG CAGCACTGAT 3960 
GGATAGCCAC AAGCAGCCTG CCGCCTTCQT GGTCACCCAG CACCCTCTAC CCAACACCGT 4020 
GGCAGACTTC TGGAGGCTGG TGTTCGATTA CAACTGCTCC TCTGTGGTGA TGCTG AATGA 4080 
GATGGACACT GCCCAGTTCT GTATGCAGTA CTGGCCTGAG AAGACCTCCG GGTGCTATGG 4140 

40 GCCCATCCAG DTGGAGTTCG TCTCCGCAGA CATCGACGAG GACATCATCC ACAGAATATT 4200 
CCGCATCTGT AACATGGCCC GGCCACAGGA TGGTTATCGT ATAGTCCAGC ACCTCCAGTA 4260 
CATTGGCTGG CCTGCCTAGC GGGACACGCC CCCCTCCAAG CGCTCTCTGC TCAAAGTGGT 4320 
CCGACGACTG GAGAAGTGGC AGGAGCAGTA TG ACGGGAGG GAGGGACGTA CTGTGGTCCA 4380 
CTGCCTAAAT GGGGGAGGCC GTAGTGGAAC CTTCTGTGCC ATCTGCAGTG TGTGTGAGAT 4440 

45 GATCCAGCAG CAAAACATCA TTGACGTGTT CCACATCGTO A A A AC AC TG C GTAACAACAA 4500 
ATCCAACATG GTGGAGACCC TGGAACAGTA TAAATTTGTA TACGAGGTGU CACTI iG A ATA 4550 
TTTAAGCTCC TTTTAGCTCA ATGGGATGGG GAACTGCCGG AGTCCAGAGG CTGCTGTGAC 4620 
CAAGCCCCCT TTTGTGTGAA TGGCAGTAAC TGGGCTCAGG AGCTCTGAGG TGGCACCCTG 4680 
CCTGACTCCA AGGAGAAGAC TGGTGGCCCT GTGTTCCACG GGGGGCTCTG CACCTTCTGA 4740 

50 GGGGTCTCCT GTTGCCGTGG GAGATGCTGC TCCAAAAGGC CCAGGCTTCC TTTTCAACCT 4800 
AACCAGCCAC AGCCAAGGGC CCAAGCAGAA GTACACCCAC AAGCAAGGCC TTGGATTTCT 4860 
GGCTCCCAGA CCACCTGCTT TTGTTCTGAG TTTGTGGATC TCTTGGCAAG CCAACTGTGC 4920 
AGGTGCTGGG GAGTGGGAGG CTCCCCTGCG CTCCTTCTCC TTAGGAGTGG AGGAGATGTG 4980 
TGTTCTGCTC CTCTACGTCA TGGAAAAGAT TGAGGCTCTT GGGGGTCACT GCTCTGCTGC 504O 

55 CCCCTGCAAC CTCCTTCAGG GGCCTCTGGC ACCAGACATT TGCAGTCTGO ACCAGTGTGA 5100 
CCTTACGATG TTCCCTAGGC CACAAGAGAG GCCCCCCATC CTCACACCTA ACCTGCATGG SI 60 
GGCTTCGCCC ACAACCATTC TGTACCCCTTCCCCAGCCTG GGCCTTGACC GT CCAGCA TT 5220 
CACTGGCCGG CCAGCTGTGT CCACAGCAGT TTTTGATAA A GGTGTTCTTT GCTTTTTTGT J280 
GTGGTCAGTG GGAGGGGGTG GAACTGCAGG GAACTTCTCT GCTCCTCCTT GTCTTTGTAA 5340 

60 AAAGGGACCA CCTCCCTGGG GCAGGGCTTG GGCTGACCTG TAGGATGTAA CCCCTGTGTT 5400 
TCTTTGGTGG TAGCTTTCTT TGGAAGAGAC AAACAAGATA AGATTTG ATT ATTTTCCAAA 5460 
GTGTATGTGA AAAGAAACTT TCTTTTGGAG GGTGTAAAAT CTTAGTCTCT TATGTCAAAA 5520 
AGAAGGGGGC GGGGGAGTTT GAGTATGTAC CTCTA AGACA AATCTCTCGG GCCTTTTATT 5580 
TTTTCCTGGC AATGTCCTTA AAAGCTCCCA CCCTGGGACA GCATGCCACT GAGCAAGGAG 5640 

65 AGATGGGTGA GCCTGAAGAT GGTCCCTTTG GTTTCTGGGG CAAATAGAGC ACCAGCTTTG 5700 
TGCATAATTT GGATGTCCAA ATTTGAACTC CTTCCTAAAG AAACCCAGCA GCCACCTTGA 5760 
AAAAGGCCAT TGTGGAGCCC ATTATACTTT GATTTAAAAT AGGCCAAGAG AATCAGGCCT 5820 
GGAGATCTAG GGTCTTGTCC AAAGTGTGAG TGAGTCAATG AGAGGGAACC AACATTTGCT 5880 
AAGTCTCTAC TGTATGCCAG GGATCATGGT TGGCACTTTC CATAGGACAT TTCACACAGT 5940 

7 0 CCTTAG AACC CCCAGG AG AG AGCTACTG AC TTGTTATCAT CTCCATTTG A TCATCTCCTC 6000 
CAATGAGGAA ACCCACGCAC CTTCCTTAGT AATGAAATCC TGGGTTCCAA AGGGGCAGGT 6060 
AATGGCAATG AGACTTCTCC GTGCTGTTTT CTTCATCTTC TCTAAGCCAA GCAATTATTT 6120 
TATGGAGGGA AAATAAGGCC AGAAACTTCT GAGCAGATAA CTCCACAAA T GGA AATTTAG 61 80 
TACTTTCTTC CTGATGCCAG TTCTTCTGGG AAGCGCAGAA TTTCAGATAT ATTTTAGTAA 6240 

75 CACATTCCCA GCTCCCCAGG AAAGCCAGTC TCATCTAATT TCTTAGTCAG TAAAAACAAT 6300 
TCCCTGTTCC TTCAGGCTAT GAATGGACCA GCCAGGGAAA CTCTCGACCT TGATCTCTAG 6360 
CCAGTGCTTA GGCCCAATAT CTGACAGCCT CAGGTGQGCT GGGACCTAGG AAGCTCCATC 6420 
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TTGAAGGCTG GTCTAGCCCC AGACAGGGCA TGAGGGGCAG AGAATTCAAG AAGGTACAGC 6480 
TTTGGCCCTC AAGAGCCCAC TGTATGCTGG GQAAATGGAA CCATGGTGCA GTAGTGTGGA 6S40 
GTGGATGAGT GTTCCATGAG CCTAGGAGCA AGAAAGTCTC TTCGGCCTCG GGCTTCCTGG 6600 
AGAAGGGGAC GTCCATTCCT GCTGGGTCTT AACAAGCATA AAAAGGAAAA AAAGGAAACT 6660 
5 CAGGCAAAGG GATCCATATG TGCAATGGCA AAGAAATGTG AAAAGGCATT GGGAHAAGCA 6720 
GTCTGGGGGA GGCCAGCCCA GTGCGGGCAC AGCACAACAC GGGOAOCAGC AAGAGATGAG 6780 
CC AGGGTCCA GGAGACAGAT GCCCATCGCG AGTACAGACr TTGTCCTATF GGCAACA AGG 6840 
AGTCCATGGA GCTTTAGAGA GATGCACTCA GCTTCGTGTT GGCCAAGACT CCTTCTGGGC 6900 
CAATGGGGCT GCCTCTTTTC CTTTCATCAG ACACTGTGA A AACATTCCCT TAAGCGTGC A 6960 

1 0 CTTTTTAATA TCACATCTAT TTGTCTGTCT GCTC ATTGTT TraTTGCTGG AACTAAATAT 7020 

GCAATGG ATG ATGAGACl'CA GATTCTATGA G AAACCCAGG GTCTCTGCT !' TACCACGG AC; 7080 
CAGGGTCACC AACCCAGATC TCCAGGCCCA TGAGGATGGA ACATGAAAGG AGCCGACAAA 7140 
AGTTGCTTCC ATTGGCATGG GCTCTGGAGC TGTCCAGAAG TCCAGGGACA CCAGACTTGA 7200 
TCAAGGAAGG GCTGTCACTT TAGAGGTTCA AAAGGAAGTG CCTCAAAGCA AAGGCAAGCA 7260 

1 5 AAGGAACCCC ACGATGAACT TGCTCTTTTC CTTTGATGAG CCTCTCCCCA GGTGTATTTC 7320 
AGCAGACCCC GGGGACCCAC CCCCACTGGG CCTGCTGGCC TCCCTCGGCT CCAGCCCAAT 73B0 
GCCCCAGCTG GCCTTCCCCA GCCTGCAAGG AGCCTGTAGC ATGGCAAATC TGCCTGCTGT 7440 
ATGCTATTTT CTTAGATCTT GGTACATCCA GACAGGATGA GGGTGGAGGQ AGAGCTATTT 7500 
AACACAAATC CTAAGATTTT TTTCTGCTCA GGAAGGGGTG AAATAGCTGG CAGATACAAA 7560 

20 AGACAGTGGC TTTTATCATT TTAAATGGTA GGAATTTAAG GTGTGACTTC AGGGAGAAAC 7620 
AAACl'TGCAA AAAAAAAAAA TCTCAGGCCA TGTTGGGGTA ACCCAGCAAG GGCCAGTGAT 7680 
GATTTCCCCC AGCTCATCCC CTTATTTTCC CACAACCCAA CCATTCTCTA AAGCAGGACA 7740 
GTGAATAGGT CTTAGGCGAG TGCACACAGG A AG AAATTGA GGCTTATGGA TGGGGATG AC 7800 
TTCCCTAAGA TCCCATGGGA CAAGGATGTG GCAAGGCTTG GATGAGATGG GGCACCAGTG 7860 

25 CCCAGGAATT TGAACATITT CCTTTACCCA GGAAATCTCC GGAGCCAACA CCACCACCCC 7920 
CAGGGGGTCT CCCCACCCCA CCCCATTTAC AGGGTGAGCT CAGCCTGTCA TGAGCAGAGG 7980 
AAAATATTAT TAATGCTCTC TGAGTCTTTA CAACAGGAGC TCTTACCTCA TAGATGTGGG 8040 
CTCimTfGG GGAAGATGCA AGGAAGTAAT GAGAAGCOCA GGAAATTTCT CCACCTGTGT. 8100 
TfATGGCCTA AATAGCTTCA GO ATGTATCT TAGCTGCACT CCAACATI'GC A'lWTtTCTO SIW 

3 0 GGGTG AAGAA TCTGGGCCA A CCAGGGGTCC TTGGGCCTCT AG AAGGCCAC AGTAGGCCTC 8220 
TCTTTGTGGG AATGGAAGGG GACAGTTTGC TTTTAGTGCT GGCCCTCTCT GTGGGTGTGG S2S0 
CCTGCAAAGG AACCAACAGA CCCTATGCTG GGGACTCTAA CATGTGAGCT CATTAAATTC 8340 
TTCCAGCATT CTAAAGGAGG GTTTGTGATT GTCACCATTT ACTGATGAGG AAACTAAGGC S4O0 
TCCTAGGGGA GAAATCACTT GCCCACAGTT CCACAGCTAG TGAGTGAATG AACCAGGATT 8460 

3 5 TAAACCGGTT TTTTCTCACT ACAGAGACA A TATTTTTCCA CCATTGTATC TCACATTTTT 8520 

CCCAGOAGGT TACCCATAAC AGAAGAGACT AGAGTGGAAC AGATACGTCA GTGGATAAAG 8580 
CTCAAAGCAA ACAACAGTAA GCTTAAAATT CCTTCATAGT CTCATGTTTT ACGTTCACAA 8640 
TTCATGCAAA ATTTGCATTC CACTTTCTGA TTTAGCCTTG TTGGTTTTAA TATGACTCTA 8700 
TGAATATTTC AAAAAAAAAT GTGCTCTGTT CCTCATGTTG TTCTGTTCTG TTCACCCCGC 8760 

40 TATGACGGAC CCTAGGTCAG CTGGTCTTCA GCTTGACCCT AGAATTGACT CTAGGAGCAG 8820 
TGACCCTGCT GCCTCCCAGA GCCAGTTATA GGCTCAAGAT CAAGACCAAC TGACCTTCTC 8880 
CTAGGCAGCT CCTTTGGTGT GTGGGTGCTC TGACCTCACT GTTCATOAGG GGACCTCAAC 8940 
TAAGGCATCT TCCAGTTGGQ TGCTGGAAGG AACCCATTAA CTCACACTAG AATGATGAGG 9030 
ATTTGCTCAT CTGGCGTGGA GAAGGATGAG CCCACAAAAC CCTAAAGGGA AAAGAGAAGC 9060 

45 FGG .( * oC IGTACTCAGC AGATTCCTGA ATGCTAGGCr GGAAAGTGGT run .IT I- >!:r 
CAAGTGGAGT CACATGGTTG CTAATGTGGG CAAGTCTGAG GACACACTTC ATGAGCAGCT 9180 
GGGGTCTGGA AGGCTCCTCA CTTTACCCTA GCCACACATA ATTACTGGGT GCCTACAGCA 9240 
CCTAGCACCT TGGAGGGGGC ACTATTAGGA AATCGAGATT ACTATGGCAC AATTAATTCC 3300 
TGGGTAAGGC ATGGGGTTGT GGTGGACAGA GCTCAGTCTT TAGTTTGAAC GAAAACATAC 9360 

50 ATACATGAAA AACATACATG AAAAAAGGAC CCTCATCAAC ATTAGAAGGG GTAGATTTGG 9420 
AGCACTTTAG GCAGGAAAAC AGGAACGCAA GGCCAGGAAA CTGGAACCCA GTGAATACTC 9480 
AGAACCGAGG ATGCAGATGA CTTATTTAGC AAAATGGTCA CTTCTGTGAC ATAGCTGGAG 9540 
AAAGGATGGG TAACAGCTTG CCAGAGCCAC TTGGAACAAG GGCAAATCTC AGTGTCTGGG 9600 
GCAAAAGATG ATGCATTTCC CTCTGACCCA TCATGTTTAT TCATCCTCCA CTCCCCATTG 9660 

55 CCACACTAGC TCTTGCTGTA AGTCCTCACC AGGATCTACA TTTCCTCGTC GCTGGTGGGA 9723 
ACCCCTTAGA GTACATAGAG GTATCAGTCC AGTAAGACTG CTCTACACAA CAGAAGTGAG 9780 
GCCCAGGGAG TAGCAGCCAG GCCCTTATCCTGTTACCTCT GCAGGAGTGA CTGCCCAACC 9840 
CAGATCCAGA GACATTGAAG GAAATGATAA TTCCTTGGTA CCTCACTGCC TTGGGACAAA 9900 
ATGAAGAAAG GCACCCTTCC TTAGGCTGCA GCTTGCCACT CCTGGGCTGG GTAAACAGGT 9960 

60 CATCAGCACC AGGCTCAACC AGGAGTAACA TTCTGGAAGA CATGGGTGAG CCCAAGAGGA 10020 
AGCATGAACA GGACGCTGTT CCTAAGTCAT GTCAACAGGT TGTGCTGGGC CAGGATCCCC 10080 
AGGGAAAAAA ATGGTCAACC CAACTGGAGG GTAGG 1TAGA AGAAAAAAAA CATAAACGTG 10140 
GATAGTCATG TCATCTCAAA TCCCTGACTT GGCTTCCCCA TTAC'ITGACA GTCT6AGCTC 10200 
CTTCTTAGCC TGTGACCAGC TTCAAATCAC AGCCAAGTAA AACAAGGAAA TAGGAAAAGT 10260 

65 AAATCCAACT AGAAGAGACA AGCTGAGATT CAGATTTGTT TACTCCTCCC ATGCAAAGTT 1 0320 
TCCCTGTTGG AGGTTTTCCA TGTATACATG TCTAGAAGTG ATAGAATGCA AGGCCTTGGC 10380 
TTTGTCTTGC AGGGATCTGC CTTTGAGGTC ATAGACTGAA CAGCAGGGAG AGAGGTTAGT 10440 
GGTGGAGTGT GGGGGQAGCT GTTCTAGCTC CAGTTTCTTC TGACACATTT TTCAGGATCA 105O0 
TGGATCTGAT CCTCCGAAGC ACAGCAGAGA TATCTAAGCC ATATTTGTGC ACATGAGCAG 10560 

70 ACTCTTCTAG TTTTTTAGTA ACCAGGGATG GGCTTTTGCA TGGCACTGAC TATAGAGATG 10620 
TCTTGTAGAG ATCAAGCCAG TCTTTTGCAT CCCACCTGCC CACCTCCAGA AGAGATGGGA 10680 
AAAGGTCATC AAAGGGCATT CACCAACTGA AATCCACTCA TGAATGTTAG GTCTCTAAAA 10740 
GGAGGCATCA ACACTCACAA TGGTAGCCTC CAAACCTAGC ATCCCACCTA TCTAAGAGCT 10800 
CAGGGGTGGT CCACTGGGGC AGATACAAGG GAAGTGCAAG GGCTCAGGAT GAAAGAAAAT 10860 

75 CTATTGGGAA GAGTTTTAGG GGCTTGATCA TTATGGGGCT TCCTTCTATA TCTGAGAACT 1 0920 
GCTCTGGGTG GTGAGATGTG GACTCTGATC CTTAATTGGA ATGTTCGGAG AATGAGTGTC 10980 
TGGTGGCCTT GAAGTGTTGG ACAGAAAAGT ATCAGTATAA AAGCCTGGAG CTCAGGGTAA 11040 
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TTAATGTAGT TCATGGTTCC TTAGTG AGCA GG ACTCTTGG ATGTGG AGG A GAA AGGGTCA 1 1 100 
TAGGAAGTAA ACCACCAAAA TTACAAAATT G AGTCTCTGT ACAATTACTT C AGTGCCTTT 11160 
GGGCTTATGA ATACAAATCA GTGGGCCTTC TCTATG ATGG TCCAACAAAC TCTCAGTGTC 1 1220 
CACCCTGTCC CTGTATCTCC CATGGAAGATGAATAATGTC AGGTGTTCTT TGGGTCAAAG 1I2S0 
5 GCCCCAGGGC AGTCTGGAGG CTTAG AGGGC AG AGTGGTGT CATTCCATGT AAAGTTAGGC U3<10 
TTCTGAGGGG TCAGGCAGAA TATGGTGTCC ATATCTTCCA TAGCTCTGCA GATTCTTGGA 11400 
TGAAGTCAAG CACAGTTTGC TAGACCCAGG TCACTCCTCT GAGTATAACT AGGACCCATG 1 1460 
AGTGAAACTT AATAGCTGTA AGGAAGAACC TGCTGTCTGC CAGAGAGGAT A AGCTGCCCA 11520 
TCTCAGCAGC TGTCTAAAAG AAGGCAGGTG TCTCTTTAAA GGGAAGAGAA GCATTGGTGA 1 1580 

1 0 AATGGATTTC AGGTCACTTC CATTCCAGAT GGGTGAGATC TTGTGGAGCT GGGATCATGT 1 1640 
TTG AACTCAT TCATACCTGT AGAGCACG A A TCCAAGTAG A TTGTC3TTTGG TCTGTACAGG 11700 
CTGAAGCCCC CTGCTCTCCC ACCCAAGTGC CCCCACTGAG CAGGCCAACA TGCTGTTGTG 1 1760 
GCCACATATA CTGGGCTGAT CCAGGCTGGT TATCACCAAA CAGCAAACCA TAGGGAACAG 1 1820 
CTGCTTTGCC AT AG ACCCA A TACCCATGTA GATCTCTCAT G AGAGCAGCC ATAACTCAG A U 880 

1 5 CCCACTGACC AACAGGGCCA TGAGTGACAG CCAOAACCAG TGAAGGTCCA AGTAGGACAC 1 1 940 
AGAGCAGGGC TTTTCTTAOC ATACACATTA TCTCCAGAGG TTATTTCTAC CCCACTCCCT 12000 
ATTCAAGGCC TGTTGGAGCA CACTGCAAAA GCAAAAGCAC AGTAACTCAA TTTACACATG 12060 
ATTATAATCA TTTCCAGTGC ACACATTTCA TCACCAGGTG GATCCTGAGC TAGCCCATGT 12120 
AAATCCGGGT TAACCCATAT TGGTAATCAT ACTCAAAAGC ACTTTTCACC CTACATTCTA i J1S0 

20 CTAGCCAATC AAAG ACAAAG AGTTGTGGCC TCTACCATTG CCTTGGCTTC TGGACACCCT 1 2240 
CACAAGCTAT CCCAAGGTTC CCGCTCAACT CCAGGGAGGC TQACATCTTC ACATCCACTG 12300 
GGCATATAAT ATTGCATGAG ACCAAAGTCT CCACACTCTT TGCAGCCTCC TCCATOAATC 12360 
CCAATGGCCT GCACTTGTAC AGTTTGGGTG TTTGATAGAT AAAGCACGTA TGAGAAGAGA 12420 
AAACAAAATA AATCAACTTT TTAAAAAAGC CAGCACTGTG CTGTCAATGT TTTTTTTTTC 12480 

25 TTTTCAATTC TAGCTCAGAA AAGCAGAAGG TAAATAATGT CAGGTCAATG AATATCAGAT 12540 
ATATTTTTTG ACTGTACATT ACAGTGAAGT GTAATCTTTT TACACCTGCA AGTCCATCTT 12600 
ATTTATTCTT GTAAATGTTC CCTGACAATG TTTGTAATAT GGCTGTGTTA AAAAATCTAT 12660 
ACAATAAAGC TGTGACCGTG 

30 Seq ID NO: 98 Protein sequence: 

Protein Accession #: NP00S98 1. 1 

1 11 21 31 41 51 

3 5 GIREFNPPRO TEtNTYROSl NEPHOSPHAT ASERECEPTO RTYPETHOMO SAPIENSMAS 60 

LAALALSLLL RLQLPPLPGA RAQSAPGGCS FDEHYSNCGY SVALGTNGFT WEQINTTEKP 120 
MLDQAVPTGS FMMVNSSGRA SGQKAHLLLP TLKENDTHC1 DFHYYFSSRD RSSPGALNVY 1 80 
VKVNGGPQGN PVWNVSG WT EGWVKAELAI STFWPHFYQV EFESVSOCGH PGYIAVDEVR 24D 
VLAHPCRKAP HFLRLQNVEV NVGQNATFQC IAGGKWSQHD KLWLQQWNGR DTALMVTRV V 300 

40 NHRRFSATVS VADTAQRSVS KYRCVIRSDG GSGVSNYAEL IVKEPPTPIA PPELLAVGAT 360 
YLWIKPNANS IIGDGPHLK EVEYRTTTGT WAETHIVDSP NYKLWHLDPD VEYEIRVLLT 420 
RPGEGGTGPP GAPLTTRTKC ADPVHGPQNV El VD1RARQL TLQWEPFGYA VTRCHSYNLT 480 
VQYQYVFNQQ QYEAEEVIQT SSHYTLRGLR PFMTIRLRLL LSNFEGRMES EELWQTEED 540 
' =i \ Pl.SSK 3C3PH KT 1 WKPPNETNG I.Li TTT-,-, I WG I.D1 JADLSSQRGKVF COO 

45 KLRN13T1-1HLF VGLYPGTTYS FTKASTAKG FGPPVTTRI/ 1 i APSMFI YDTDTPLNET 660 
DTTl'l'VMLfti' AQSRGAI Q I c 1 < >i SRI He 1 "I SL IV " 

FAAELKPANL PVTQPFTVGD NKTYNGYWNP PLSPLKSYSI YFQALSKANG ETKINCVRLA 780 
TKAPMGSAQ V TPGTPLCLLT TGASTQNSMT VEPEKQVDNT VKMAGVIAGL LMFHILLGV 840 
MLT1KRRRNA YSYSYYLSQR KLAKKQKETQ SGAQREMGPV ASADKPTTKL SASRNDEGFS 900 

50 SSSQDVNGFT DGSRGELSQP TLTiQTKPYR TCDPVEMSYP RDQFQLAIRV ADLLQH1TQM 960 

KRGQG YGFKE EYEALPEGQT ASWDTAKEDE NRNKNRYGNI ISYDHSRVRL LVLDGDPHSD 1 020 
YJNANYIDGY HRPRHYIATQ GPMQETVKDF WRMTW QENS A SrVMVTNLVE VGRVKCVRYW 1080 
PDDTEVYGDI KVTUETEPL AEYVIRTFTV QKKGYKEIRE LRLFHFTSWP DHGVPCYATG 1 140 
LLGFVRQVKF LNPPEAGPIV VHGSAGAGRT GCFIAIDTML DMAENEGWD IFNCVRELRA 1200 

55 QRVNLVQTEE QY VFVHDAIL EACLCGNTAI PVCEFRSLYY NISRLDPQTK SSQ1KDEFQT 1260 
LN1 VTPRVRP EDCSIGLLPR NHDKNRSMDV LPLDRCLPFL ISVDGESSNY INAALMDSHK 1320 
QP AAFVVTQH FLPNTV ADFW RLVFDYNCSS WMLNEMDTA QFCMQYWPEK TSGCYGPIQV 1380 
EFVSADIDED IIHRIFRICN MARPQDGYR1 VQHLQYIGWP AYRDTPFSKR SLUrCVVRRLE 1440 
KWQEQYDGRE GRTVVHCLNG GGRSGTFCAI CSVCEMIQQQ MUDVFPflVK TLRMNKSNMV 1 500 

60 ETLEQYKFVY EVALEYLSSF 

Seq ID NO: 99 DNA sequence 

Nucleic Acid Accession #: NM_002988,1 

Coding sequence: 71-340 (underlined sequences correspond (o start and stop codons) 

65 

I 11 21 31 41 51 

CCGGCACGAG AGG AGTTGTG AGTTTCCAAG CCCCAGCTCA CTCTG ACCAC TTCTCTGCCT 60 
GCCCAGCATC ATGAAGGGCC TTGCAGCTGC CCTCCTTGTC CTCGTCTGCA CCATGGCCCT 1 20 

70 CTGCTCCTGT GCACAAGTTG GTACCAACAA AGAGCTCTGC TGCCTCGTCT ATACCTCCTG 180 
GCAGATTCCA CAAAAGTTCA TAGTTGACTA TTCTGAAACC AGCCCCCAGT GCCCCAAGCC 240 
AGGTGTCATC CTCCTAACCA AGAGAGGCCG GCAGATCTGT GCTG ACCCCA ATAAGAAGTG 300 
GGTCCAGAAA TACATCAGCG ACCTGAAGCT GAATGCCTGA GGGGCCTGGA AGCTGCGAGG 360 
GCCCAGTGAA CTTGGTGGGC CCAGGAGGGA ACAGGAGCCT GAGCCAGGGC AATGGCCCTG 420 

7 5 CCACCCTGG A GGCCACCTCT TCTAAG AGTC CCATCTGCTA TGCCC AGCCA CATTAACTA A 4«0 
CTTTA ATCTT AGTTTATGCA TCATATTTCA TTTTG AA ATT GATTTCTATT GTTGAGCTGC 540 
ATTATGAAAT TAGTATTTTC TCTGACATCT CATGACATTG TCTTTATCAT CCTTTCCCCT 600 
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TTCCCTTCAA CTCTTCGTAC ATTCAATGCA TGGATCAATC AGTGTGATTA GCTTTCTCAG 660 
CAGACATTGT GCCATATGTA TCAAATGACA AATCTTTATT GAATGGTTTT GCTCAGCACC 720 
ACCTTTTAAT ATATTGGCAG TACTTATTAT ATAAAAGGTA AACCAGCATT CTCACTGTGA 780 
AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO; 1 00 Protein sequence: 
Protein Accession ft NPJ0O2979. 1 

1 II 21 31 41 51 
I 1 I I I I 

MKGLAAA1XV LVCTMALCSC AQVGTNKELC CLVYTSWQIP QKFIVDYSET SPQCPKPGVI 60 
LLTKRGRQIC ADPNKKWVQK YISDLKI-NA 

Seq ID NO: 101 DNA sequence 

Nucleic Acid Accessions NM 015507.2 

Coding sequence: 241-1902 (underlined sequences correspond to start and stop CDdons) 

1 11 21 31 41 51 
11)111 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 
CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 
CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTACG I SO 
GGGTCGGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGG AG GAAGG AGGAC CCGTGCGAGA 240 
ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTCGGTGGC AGGTGGTTTC 300 
GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 
TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 
TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 460 
AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG S4n 
AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTnTGC 600 
CTCAGTGGCC ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 
ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 720 
TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 
GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 
AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 
AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 96D 
GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 
ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 10E0 
AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1 140 
CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 
ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 
GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 
AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 
CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 
GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 
TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAG'ITCC UuCCITGGCA 1 560 
GGTCACAAGA AAGACA1TGG CCOATTGAAA CITCTOCTAC CTGACCTGCA ACCCCAAAGC 1C2Q 
AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAG ACA AAGTCGGGAA ACTTCGAGTG 1 680 
TTTGTGA A AA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTG A GGATGAAAAG ! 740 
TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 
GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GA A ATCGC AG TGGATGGCGT CTTGCTTGTT 1860 
TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGACTGAATGTTACT ATCTTTATAT 1920 
TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 19S0 
TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 
TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 
TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAOTTTAT CTCCCCTCCT 2160 
CAOTATATCT GATTTGTATA AGTAAGTTG A TG AGCTTCTC TCTACAACAT TTCTAGAAAA 2220 
TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 
TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 

Seq ID NO: 102 Protein sequence: 
Protein Accession //: NP_05S322.2 

1 11 21 31 41 51 

1 I I 1 I 1 

MPLPWSLALP LLLSVWAOGF GNAASARHHG LLASARQPGV CHYGTKLACC YGWRRNSKGV 60 
CEATCEPGCK FGECVGPNKC RCFPGYTGKT CSQDVNECGM KPKPCQHRCV NTHGSYKCFC 120 
LSGHMLMPDA TCVNSRTCAM INCQYSCEDT EEGFQCLCPS SGLRLAPNGR DCLDIDECAS 1 80 
GKVICPYNRR CVNTFGSYYC KCfflGFELQY 1SGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 
GSFKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTIKDR] KKLLAHKNSM KKKAKIKNVT 300 
PEPTRTPTPK VNLQPFNYEE IVSRGGNSHG GKKGNEEKMK EGLEDEKREE KALKNDIEER 360 
SLRGDVFFPK VNEAGEFGLt LVQRKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 420 
WNFADRDNAI GFYMAVPALA GHKKDIGRLK L1XPDLQPQS NFCLLFDYRL AGDKVGKLRV 4S0 
FVKNSNNALA WEKTTSEDEK WKTGKIQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVIXV 540 
SGLCPDSLLS VDD 

Seq ID NO: 103 DNA sequence 
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Nucleic Acid Accession #: NMJ>01565.1 

Ceding sequence: 67-363 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 60 
AGCACCATGA ATCAAACTGC GATTCTGATT TOCTGCCTTA TCTTTCTGAC TCTAAGTGGC 1 20 
ATTCAAGGAG TACCTCTCTC TAGAACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCA A 1 80 
CCTGTTAATC CAAGGTCTTT AGAAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 
1 0 CGTGTTGAGA TGATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA 300 
TCGAAGGCCA TCAAHAATTT ACTGAAAGCA GTTAGCAAQG AAATGTCTAA AAGATCTCCT 360 
TAAAACCAGA GUGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGC 1 G <12U 
CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 

gttacactaa aaggtgacca atgatggtca ccaaatcagc tgctactact cctgtagoaa 540 
1 5 ggttaatgtt catcatccta agctattcag taataactct accctggcac tataatgtaa 600 
gctctactga ggtgctatgt tcttagtgga tgttctgacc ctgcttcaaa tatttccctc 660 
acctttccca tcttccaagg gtactaagga atctttctgc tttggggttt atcagaattc 720 
tcagaatctc aaataactaa aaggtatgca atcaaatctg ctttttaaag aatgctcttt 780 
acttcatgga cttccactqc catcctccca aggggcccaa attctttcag tggctaccta 840 

2 0 CATACAATTC CAAACACATA CAGGAAGGT A OA AATATCTG AAAATGTATG TGTAAGTATT 900 
CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 960 
TTTCAGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 1020 
TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 1080 
TTTTCAAATA AAAATGAGGT ACTCTCCTGG AAATATTAAG 

25 

Sea, ID NO: 104 Protein sequence: 
Protein Accession #: NP_00 1 556. 1 

1 11 2) 31 41 51 
30 | | 1 | I I 

MNQTAILICC LIFLTLSGIQ GVPLSRTVRC TCISISNQPV NPRSLEKLEI IPASQFCPRV 60 
EIIATMKfCKG EKRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NO: 105 DNA sequence 
35 Nucleic Acid Accession ft NM.Q15068.J 

Coding sequence: 1 1 70-2243 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 
. n I )■ I t 1 I 

40 GTAACAACCG TCACCCTGGG TCCCGACTGC CCACCTCCTC CTCCTCCCCC TCCCCCCAAC 60 

AACAACAACA ACAACAACTC CAAGCACACC GGCCATAAGA GTGCGTGTGT CCCCAACATG 120 
ACCGAACGAA GAAGGG ACGA GCTCTCTGAA GAGATCAACA ACTTAAG AG A G AAGGTCATG 1 80 
AAGCAGTCGG AGG AGAACAA CAACCTGCAG AGCCAGGTGC AGAAGCTCAC AGAGGAGAAC 240 
ACCACCCTTC GAGAGCAAGT GGAACCCACC CCTGAGGATG AGGATGATGA CATCGAGCTC 300 

4 5 CGCGGTGCTG CAGCAGCTGC TGCCCCACCC CCTCCAATAG AGGAAG AGTG CCCAGAAGAC 360 
CTCCCAtiAGA AGTTCGATGG CAACCCAGAC ATGCTGGCTC CTTTCATGGC CCAGTGCCAG 420 
ATCTTCATGG AAAAGAGCAC CAGGGATTTC TCAGTTGATC GTGTCCGTGT CTGCTTCGTG 480 
ACAAGCATGA TGACCGGCCG TGCTGCCCGT TGGGCCTCAG CAAAGCTGGA GCGCTCCCAC 540 
TACCTGATGC ACAACTACCC AGCTTTCATG ATGGAAATGA AGCATGTCTT TGAAGACCCT 600 

50 CAGAGGCGAG AGGTTGCCAA ACGCAAGATC AGACGCCTGC GCCAAGGCAT GGGGTCTGTC 660 
ATCGACTACT CCAATGCTTT CCAGATGATT GCCCAGGACC TGGATTGGAA CGAGCCTGCG 720 
CTGATTGACC AGTACCACGA GGGCCTCAGC GACCACATTC AGGAGGAGCT CTCCCACCTC 780 
GAGGTCGCCA AGTCGCTGTC TGCTCTGATT OGGCAGTGCA TTCACATTGA GAGAAGGCTG 840 
GCCAGGGCTG CTGCAGCTCO CAAGCCACGC TCGCCACCCC GGGCGCTGGT GTTGCCTCAC 900 

55 ATTGCAAGCC ACCACCAGGT AGATCCAACC GAGCCGGTGG GAGGTGCCCG CATGCGCCTG 960 
ACGCAGGAAG AAAAAGAAAG ACGCAGAAAG CTGAACCTGT GCCTCTACTG TGGAACAGGA 1020 
GGTCACTACG CTGACAATTG TCCTGCCAAG GCCTCAAAGT CTTCGCCGGC GGGAAACTCC 1080 
CCGGCCCCGC TGTAGAGGGA CCTTCAGCGA CCGGGCCAGA AATAATAAGG TCCCCACAAG 1140 
ATGATGCCTC ATCTCCACAC TTGCAAGTGATGCTCCAGAT TCATCTTCCG GGCAGACACA 1 200 

60 CCCTGTTCGT CCGAGCCATG ATCGATTCTG GTGCTTCTGG CAACTTCATT GATCACGAAT 1260 
ATUTTGCTCA AAATGGAATT CCTCTAAGAA TCAAGGACTG GCCAATACTT GTGGAAGCAA 1320 
TTGATGGGCG c:CCCATAGCA TCGGGCCCAG TTGTCCACGA AACTCACGAC CTGATAGTTG 13S0 
ACCTGGGAGA TCACCGAGAG GTGCTGTCAT TTGATGTGAC TCAGTCTCCA TTCTTCCCTG 1440 
TCGTCCTAGG GGTTCGCTGG CTGAGCACAC ATGATCCCAA TATCACATGG AGCACTCGAT 1500 

65 CTATCGTCTT TGATTCTGAA TACTGCCGCT ACCACTGCCG GATGTATTCT CCAATACCAC 1S60 
CATCGCTCCC ACCACCAGCA CCACAACCGC CACTCTATTA TCCAGTAGAT GGATACAGAG 1620 
TTTACCAACC AGTGAGGTAT TACTATGTCC AGAATGTGTA CACTCCAGTA GATGAGCACG 1680 
TCTACCCAGA TCACCGCCTG GTTGACCCTC ACATAGAAAT GATACCTGGA GCACACAGTA 1740 
TTCCCAGTGG ACATGTGTAT TCACTGTCCG AACCTGAAAT GGCAGCTCTT CGAGATTTTG 1800 

70 TGGCAAGAAA TGTAAAAGAT GGGCTAATTA CTCCAACGAT TGCACCTAAT GGAGCCCAAG 1 660 
TTCTCCAGGT GAAGAGGGGG TGGAAACTGC AAGTTTCTTA TGATTGCCGA GCTCCAAACA 1920 
ATTTTACTAT CCAGAATCAG TATCCTCGCC TATCTATTCC AAATTTAGAA GACCAAGCAC 1980 
ACCTGGCAAC GTACACTGAA TTCGTACCTC AAATACCTGG ATACCAAACA TACCCCACAT 2040 
ATGCCGCGTA CCCGACCTAC CCAGTAGG AT TCGCCTGGTA CCCAGTGGG A CG AGACGGAC 2 1 00 

75 AAGGAAGATC ACTATATGTA CCTGTOATGA TCACTTGGAA TCCACACTGG TACCGCCAGC 2160 
CTCCGGTACC ACAGTACCCG CCGCCACAGC CGCCGCCTCC ACCACCACCA CCGCCGCCGC 2220 
CTCCATCTTA CAGTACCCTG TAAATACCTG TCATGTCCTT CAGGATCTCT GCCCTCAAAA 2280 
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TTTATTCCTG TTCAGCTTCT CAATCAGTGA CTGTGTGCTA AATTTTAGGC TACTGTATCT 2340 
TCAGGCCACC TGAGGCACAT CCTCTCTGAA ACGGCTATGG AAGGTTAGGG CCACTCTGGA 2400 
CTGGCACACA TCCTAAAGCA CCAAAAGACC TTCAACATTT TCTGAGAGCA ACAGAGTATT 2460 
TGCCAATAAA TGATCTCTCA TTTTTCCACC TTGACTGCCA ATCTAACTAA AATAATTAAT 2520 
5 AAGTTTACTT TCCAGCCAGT CCTGGAAGTC TGGGTTTTAC CTGCCAAA AC CTCCATCACC 2580 
ATCTAAATTA TAGOCTGCCA AATTTGCTGT TTAACATTTA CAGAGAAGCT GATACAAACG 2640 
CAGGAAATGC f GATTTCTTT ATGGAGGGGG AGACGAGGAG GAGGAGGACA TGACTTTTCT 2700 
TGCGGTTTCG GTACCCTCTT TTTAAATCAC TGGAGGACTG AGGCCTTATT AAGGAAGCCA 2760 
AAATTATCGG TGCAGTGTGG AAAGGCTTCC GTGATCCTCT CGCTGCACCC TTAGAAACTT 2S20 

1 0 CACCGTCTTC AAACTCCATT TCCATGGTTC TGTTAATTCT CA AGGAGCAG CAACTCGACT 2880 

GGTTCrrCCCA GGAGCAGGAA AAACCCTTGT G ACATGAAAC ATCTCAGGCC TGAAAAGAAA 2940 

1 ( IGTTA AGACTATGTC CA'l 

ACATGTAGCC AATGACTCCG GCTTTGACAC AACACCTTAC CATCATCATQ CCATGATGGC 3060 
TTCCACAAAG CATTAAACCT GGTAACCAGA GATTACrGGT GGCTCCAGCG TTGTTAGATG 3120 

1 5 TTCATGAA AT GTGACCACCT CTCAATCACC TTTG AGGGCT AAAGAGTAGC ACATCA AAAG 3180 
GACTCCAAAA TCCCATACCC AACTCTTAAG AGATTTGTCC TGGTACTTCA GAAAGAATTT 3240 
TCATG AGTGT TCTTAATTCG CTGGAAAAGC ACCAGCTGAC GTTTTGGAAG AATCTATCCA 330D 
TGTGTCTGCC TCCATATGCA TCTGGGCATT TCATCTTCAG TCCCCTCATT AGACTGTAGC 33 60 
ATTAGGATGT GTGGAGAGAG GAGAAATQAT TTAGCACCCA GATTCACACT CCTATGCCTG 3420 

20 GAAGGGGGAC ATCTTTGAAG AAGAGGAATT AGGGCTGTGG ACACTGTCTT GAGGATGTGG 3480 
ACTTCCTTAG TGAGCTCCAC ATTACTTGAT GGTAACCACT TCAAAAGGAT CAGAATCCAC 3540 
GTAATGAAAA AGGTCCCTCT AGAGGATGG A GCTQATGTGA AGCTGCCAAT GGATGAAAAG 3600 
. CCTCAGAAAG CAACTCAAAG G ACTCAAAGC AACGGACAAC ACAAGAGTTG TCTTCAGCCC 3660 
AGTGACACCT CTGATGTCCC CTGGAAGCTT TGTGCTAACC TGGGACTGCC TGACTTCCTT 3720 

25 TAGCCTGGTC CCTTGCTACT ACCTTGAACT GTTTTATCTA ACCTCTCTTT TTCTGTTTAA 3780 
TTCTTTGCTA CTGCCATTGA CCCTGCTGCA GGATTTGTGT CATTTTCCTG CCTGGTTGCT 3840 
GAGACTCCAT TT1 GCTGCCA CACACAGAGA TGTAAGAGGC AGGCTTTAAT TGOCAAAGCA 3900 
CAG TTTGAGC AGTAGAAAAC AACATGGTGT A7ATCTCAAA TTGCCTGACA TGAAGAGGAG 39S0 
TCTA ACGGTG AAGTTTCACT TTTCATCAGC ATCATCTTTC ACATGTTCAT TATCATCCGC 4020 

30 TCTTATTCTT GCATGTTTAA ACACTT AA AA TTTTTAGTAT A ATTTTTAGT GTGTTTTG AA 4080 
GTGGTGACTA GGCTTTCAAA AACTTCCATT 0 A ATTACAAA GCACTATCCA GTTCTTATTG 4140 
TTAAACTAAG TAAAAATGAT AAGTAACATA GTGTAAAATA TTCCTTTACT GTGAACTTCT 4200 
TACAATGCTG TGAATGAGAG GCTCCTCAGA ACTGGAGCAT TTGTATAATA ATTCATCCTG 4260 
TTCATCTTCA ATTTTAACAT CATATATAAT TTCAATTCTA TCAATTGGGC CTTTAAAAAT 4320 

3 5 CATATAAAAG GATATAAAAT TTGAAAAGAG AAACCTAATT GGCTATTTAA TCCAAAACAA 43S0 
CTTTTTTTTT TCCTTCAATG 0 AATCAGAAA GCTTGTCAAT CACTCATGTG TTTTAGAGTA 4440 
ATTACTTTTA AAATGGTGCA TTTGTGCTTC TGAACTATTT TGAAGAGTCA CTTCTGTTTA 4500 
CCTCA AGTAT CAATTCATCC TCCATACATT TG AATTCAAG TTGTTTTTTG TCAA ATTTAC 4560 
AGTTGTCAAT TGATCTTCAA GCTGCAGGGT GCCTAG AAAT GGGCCGTTGT CTGTAGCCCT 4620 

40 GGCATGTGCA CACGGACATT TGCCACCACT GCAAGCAAAA GTCTGGAGA A GTTCACCAAC 4680 
GACAAGAACG ATTAGGGAAA ATATGCTGCT GTGGGTTAAC AACTCAGAAA GTCCCTGATC, 4740 
C ACATTTGGC TGTTTACTAA AGCTl'GTGAT TAACTTTTTG GCAGTGTGTA CTATGCTCTA 4800 
TTGCTATATA TGCTATCTAT AAATGTAGAT GTTAAGGATA AGTAATTGTA AATTTATTAT 4860 
TCTATAGTTT TG AAGTTTGG TTA AGTTTCC TTTCACTCAA TTGATTTATT TTGTTGTTA A 4920 

45 TCAAATTTAT GTTAATTGGA TCCTTTAAA1 "ITTTTl'TGGC ATTTTCCAAC AAAAATGGCT 49S0 

TTATTCATAA GAA AGGAA AA AAATCAATGG AATTTGATAT CTA AAGAAO T TACiAAAGGGA 5040 
GCAAAATAAA AAACATAAAG GAGATAGATG AATTAGTAAG CAAATCAGTA GTCGAGTTTT 5100 
TCAAACTGGC AAAATTAATT AATTGACTTT TAGCCCAAAT TTACATTGTT AATTAAATCA 5160 
AGAAGGAAGA AGATCTAAGA GCTCCCATTG ATACGCAAGC CTAGAGAGAA CTAGCTAAAT 5220 

5 0 TTATCATGCT AGG ATATTGA AACACAGAAA GTTTACATAC ATTTATGAAG GGTCAATTTA S280 
GTTTGGACAG TGAGGTATTr GTCTTAGTGG AAAAAAGGAG AATTAGTCTO ATCAAATCGT 5340 
GAAGTAATAC AGTGAACTTG CAGGTGCACA AAATAAGAGG GCCACATCTA TATGGTGCAG 5400 
TCTGG AATTC TGTTTAAGTT TGTAGGTACC TCTTGG ACTT CTGAATTGAT CCAGTTGTCA 5460 
TCCACCACAG ACATCTCACA TCAGATACAG ACAGTTCCAA GATTGACAAC AGAGAACAAC 5520 

55 CTGCTGGAAA GACCTGGGCA GAAATGGAGA GCCCTGCGGG AACCATGCTA CATTTTCATG 55S0 
TAAAGAGAGA ATGCACATCT GATGAGAGTG AAAGTTCTTT GTTGTTTTAG ATTGTAGAAT 5640 
GGTATTGAAT TGGTCTGTGG AA AATTGCAT TGCTTTTATT TCTTTGTGTA ATCAAGTTTA 5700 
AGTAATAGGG GATATATAAT CATAAGCATT TTAGGGTGGG AGGGACTATT AAGTAATTTT 5760 
AAGTGGGTGG GGTTATTTAG AATGTTAGAA TAATATTATG TATTAGATAT CGCTATAAGT 5820 

60 GGACATGCGT ACTTACTTGT AACCCTTTAC CCTATAATTG CTATCCTTAA AGATTTCAAA 5880 
T rCGGA GGGAAC1 KJAC rTATTTAGA < U 1 iATAA 1 

AACCCCAGTG GGAGAAAG'IT CAAAGGTGAT TAGATTAATA ATI TAATAGA GGATGAGTGA 60C0 
CCTCTCATAA ATTACTGCTA GAATGAACTT GTCAATGATG GATGGTAAAT TTTCATGGAA 6060 
GTTATAAAAG TGATAAATAA AAACCCTTGC TTTTACCCCT GTCAGTAGCC CTCCTCCTAC 6120 

65 CACTGA ACCC CATTGCCCCT ACCCCTCCTT CTA ACTTTAT TGCTGTATTC TCTTCACTCT 61 80 
ATATTTCTCT CTATTTGCTA ATATTGCATT GCTGTTACAA TAAAAATTCA ATAAAGATTT 6240 
AGTGGTTAAG TGC 

Seg ID NO: 106 Protein senuencc: 
70 protein Accession ft NP 055883.1 

1 11 21 3i 41 51 

MTERRRDELS EeInNLREKV MKQSEE>TNNL QSQVQKLTEE NTTLREQVEP TPEDEDDDIE 60 
75 LRGAAAAAAP PPPIEEECPE DLPEKFDGNP DMLAPFMAQC QIFMEKSTRD FSVDRVRVCF 120 

VTSMMTGRAA RWASAKLERS HYLMHNYPAF MMEMKHVFED PQRREVAKRK IRRLRQGMGS 1 80 
VIDYSNAFQM IAQDLDWNEP ALIDQYHEGL SDHIQEELSH LEVAKSLSAL IGQCIfflERR 240 
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LARAAAARKP RSPPRALVLP HIASHHQVDP TEPVGG ARMR LTQEEKERRK. KLNLCLYCGT 300 
GGHVADNCPA KASKSSPAGN SPAPL 

Scq ID NO: 107 DNA sequence 
5 Nucleic Acid Accession #: NMJ)03679.1 

Coding sequence: 47-1 507(underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 

1U GGCACGAGCA GAAGCAACAA TAATTGTGAA AAATACTTCA GCAGTTATGG ACTCATCTGT 60 
CA1TCAAAGG AAA A A AG TAG CTGTCATTGG TGGTGGCTTG GTTGGCTCAT TACAAGCATG 110 

crrrcrrGCA aagaggaatt tccagattg a tgtatatgaa gctagggaag atactcgagt i bo 

GGCTACCTTC ACACGTGGAA GAAGCATTAA CTTAGCCCTT TCTCATAGAG GACGACAAGC 240 
CTTGAAAGCT GTTGGCCTGG AAGATCAGAT TGTATCCCAA GGTATTCCCA TGAGAGCAAG 300 

1 5 AATGATCCAC TCTCTTTCAG GAAAAAAGTC TGCAATTCCC TATGGGACAA AGTCTCAGTA 360 
TATTCTTTCT GTAAGCAGAG AAAATCTAAA CAAGGATCTA TTGACTGCTG CTGAOAAATA 420 
CCCCAATGTG AAAATGCACT TTAACCACAG GCTGTTGAAA TGTAATCCAG AGGAAGGAAT 4S0 
GATCACAGTG CTTGGATCTG ACAAAGTTCC CAAAGATGTC ACTTGTGACC TCATTGTAGG 540 
ATGTGATGGA GCCTATTCAA CTGTCAGATC TCACCTGATG AAGAAACCTC GCTTTGATTA 600 

20 CAGTCAQCAG TACATTCCTC ATGGGTACAT GGAGTTGACT ATTCCACCTA AGAACGGAGA 660 
TTATGCCATG GAACCTAATT ATCTGCATAT TTGGCCTAGA AATACCTTTA TGATGATTGC 720 
ACTTCCTAAC ATGAACAAAT CATTCACATG TACTTTGTTC ATGCCCTTTG AAGAGTTTGA 7S0 
AAAACTTCTA ACCAGTAATG ATGTGGTAGA TTTCTTCCAG AAATACTTTC CGGATGCCAT 840 
CCCTCTAATT GGAGAGAAAC TCCTAGTGCA AGATTTCTTC CTGTTGCCTG CCCAGCCCAT 900 

25 GATATCTGTA AAGTGCTCTT CATTTCACTT TAAATCTCAC TGTGTACTGC TGGGAGATGC 960 
AGCTCATGCT ATAGTGCCGT TTTTTGGGCA AGGAATGAAT GCGGGCTTTG AAGACTGCTT 1020 
GGTATTTGAT GAGTTAATGG ATAAATTCAG TAACGACCTT AGT'iTGTGTC TTCCTGTGTT 1080 
CTCAAGATTG AGAATCCCAG ATG ATCACGC GA'lTfCAGAC CTATCCATGT ACAATTACAT 1 140 
AGAGATGCOA GCACATGTCA ACTCAAGCTG GTTCATTT1T CAGAAGAACA TGGAGAGATT 1200 

3 0 TCTTCATGCG ATTATGCCAT CGACCTTTAT CCCTCTCTAT ACAATGGTCA CTTTTTCCAG 1260 

AATAAGATAC CATGAGGCTG TGCAGCGTTG GCATTGGCAA AAAAAGGTGA TAAACAAAGG 1320 
ACTCTTTTTC TTGGGATCAC TGATAGCCAT CAGCAGTACC TACCTACTTA TACACTACAT 1380 
GTCACCACGA TCTTTCCTCT GCTTGAGAAG ACCATGGAAC TGGATAGCTC ACTTCCGGAA 1440 
TACAACATGT TTCCCCGCA A AGGCCGTGGA CTCCCTAGAA CAAATTTCCA ATCTCATTAG 1 500 

3 5 CAGGTGATAG AA AGGTTTTG TGGTAGCA AA TGCATG ATTT CTCTGTGACC A AAATTAAGC 1 560 
ATG AAA AA AA TGTTTCCATT GCCATATTTG ATTC ACTAGT GG AAGATAGT GTTCTGCTTA 1 620 
TAATTAA ACT GAATGTAG AG TATCTCTGTA TGTTAATTGC AATTACTGGT TGGGGGGTGC 1 6SC 
ATTTTAAAAG ATGAAACATG CACCTTCCCT ACATTACACA CACTCAGGTT GAGTCATTCT 1740 
AACTATAAAA GTOCAATGAC TAAGATCCTT CACTTCTCTG AAAGTAAGGC CCTAGATGCC 1 800 

40 TCAGGG AAGA CAGTAATCAT GCCTTTTCTT TAAAAGACAC AATAGGACTC GCAACAGCAT 1 860 
TGACTCAACA CCTAOGACTA AAAATCACAA CTTAACTAGC ATGTTAACTG CACTTTTCAT 1920 
TACGTGAATG GAACTTACCT AACCACAGGG CTCAGACTTA CTAGATAAAA CCAGAAATGG 1980 
AAATAAGGAA TTCAGGGGAG TTCCAGAGAC TTACAAAATG AACTCATTTT ATTTTCCCAC 2040 
CTTCAA ATAT AAGTATTATC ATCTATCTGT TTATCGTCTA TCTATCTATC ATCTATCTAT 2100 

45 CTATCTATCA TCTATCTATC TATCTATCTA TCTATCTATC TATCTATCTA TCTATCTATC 2 1 60 
TCTATTTATT TATGTA'ITTA GAGATCAGGT L'TCACTCTGT TGACCAGGCT GGAGTGCAGT 2220 
GGTGAGATCT GGGTTCACTG CAACCTCTGC CTCCTGGGCT CAAGCAATCC TCCCACTTCA 2280 
GCCTCCCAAA TAGCTGGGGC TACCATGGTA TTTTTCAGTA GAG ACCGGGT CTTGCC ATGC 2340 
TGCCCAGGCC AGTCTCAAAC TCCTGGCCTC ATGTG ATCTG CCCACCTCAG CCTCCCA AAG 2400 

5 0 TACAGGGATT AGAGTTGTGA GCCACCGCTG CCAGCCCAGA GTTACCCTCT AA AGATAAGA 2460 
AAAAGGCTAT TAATATCATA CTAAGTGAAG GACAGGAAAG GGTTTTATTC ATAAATTAAA 2520 
TGTCTACATG TGCCAGAATG GAAAGGAAAC AAGGGGAGAC AACTTTTATA GAAATACAAA 25S0 
GCCATTACTT TATTCAATTT CAGACCCTCA G AAGCAATTT ACTAATTTAT TCTTCGACTA 2640 
CATACTGCAG CAG AACCAGC AATACACTTG ATTTTTAAAA GCACATTTAG TG AA ATGTTT 2700 

5 5 TCTTTGGTTC ATCCTTCTTT AACAGGCTGC TGAGTCACTC AGAAATCCTT CAAACATGAT 2760 

TAATTATGAA GATGAAACAC TAGAGTCATA TAAGAAATAA AAATTGGGCA ATAAAATAAA 2S20 
ATGATTCAGT GTTTCTTTTC TATATTGTCA ATGAAAACCT TGAGTTCTAA TAATCCATGT 2880 
TCAGTTTOTA GGGAAAGAAA AAATAATTTT TCCTTCTACC CACTTTAGGT TCCTTGGCTG 2940 
GGGCCCCTAT AACAAAAGAC AGATTGACAA GAGAAAAACA AACATAAATT TATTAGCGGG 3000 

60 TATATGTAAT ATATATGTGG GAAATACAGG GGAATGAGCA AATCTCAAAG AGCTGGCGTC 3060 
TTAGAAC TCC CTGGCTTATA TAGCATCGAC A AAGAACAGT AAATTTTTAG AGAAACAACA 3 1 20 
A A A C A A A C r A A AAAGAGCTTT GAGTCTGTAG GGGCAGC :AAT 1TGGGGGAAG CAAA'L'ATATG 3180 
GGAGTTTGOC TTGTAGATTC CTCTGGTGGT GGTCl'CCAGG CTGACAAGGA TTCAAAGTTG 3240 
TCTCTG AAAC TCCTCTTTGT CATACTGCAC ATATA AAACG TCTTTTGTTT CCAACAAGAG 3300 

65 GATTTCTTTT TCATTCTAG A ATTATCTCCT TGATAACTTG ATCAGATATA GGACATGACA 33 60 
CTGAATAGAG TCCAACAGTA CAAAAAAAAT TCAGTATGTT CTAGCTACTT CACACATGTG 3420 
TACGCGACAG TTATTTTTAC AGTAAGGTAT TTTCGAGAAA AATGCATTAC GTGTTTTGGA 3480 
AAATAGAGTA ATTTAAAAAA TATATTTGAA ATGAAAATCT CCAACACATT AGAAGATGAT 3540 
GATGTTAGAT GCCCATCGTG TGCCACAAGT GGTTTTTTCA TTATGTAAAG CACCCGTTGA 3600 

70 ATTAAAAGAA TTTGTTTTTG TTCAACCTCT TCCTGAGGCC CAAGAGCATA TGGGCAATTC 3660 
GGATTTCCTG CTGGACCACA AGGTTCTGTT GATATTACAT AGAAACGGGT ATTCCAGACA 3720 
CTTCTTATGA TGAAAGTCCA AAAGTGGCAT CCAATTTAAG GCCCCATCTT TCGTTGCCAT 3780 
TCTTCATTCC TACAAAGG AC GAACTTGGAT TACATCA ACT TTGGACCCAT TGGTTTTGTC 3840 
GCtGTCGTCA ACTGACAGTG ATTCACCACT GGTGATGATA AAAATGATGG AAGAAGAGTT 3900 

75 GAAAGTCACT TTTTTCTTTG GCCTGTCCCC ATCTTTCTGT GACATCACAA TGGGTCTGAT 3960 
CTGCATTTCA CTTCCAGCTG CTGGTAGGTC TTTAGCAGGC CTCTGGCACC TCAGCAGTCG 4020 
GAGGCACAGA AGCTGCAAAA GGGATCTTCG AAACTGGGCA GAGAAAAAAT AAAGTGGAAT 4080 

396 



WO 02/059377 



PCT/US02/02242 



ATTAAGTAAA AGTTGGGCAC TAATCTGGAT TAACATTCGA GGAAATCAOT TGAGCTGATT 4140 
TAAGTTGTTT TTTGTTTGTT AGCACGTGTG GATGTGGGGT TATGTGGTCA TGCTCAGATC 4200 
TACCTAAATC ACCCCAG AGC TTTATGTCTT TTATTCATTC TAA ATCTTAT TAACCGGAAT 4260 
ATGTAGOACC ATTTCA AT AC CTTGTAATCC TCCAAGCTTC AATCTGCACA CACTTTCTAT 4320 
5 GAGGGCAGGT ACAACTATTA AGAGATTTTG AACATTAAGT TAGTCCACAA ATATTCAGTG 4380 
GGCATCTACT AGGTGACAGC CACTGTGCTA TAATTAGAGA CTTTTTACTA TAAGCATCAA 4440 
AAACAGATAA GGCTCTTCCT GGCAGAGTTT ACAGCCTGGT GTACTTGCTA ATGTCTCTTT 4500 
AATTAGGTGA AGA ATTTTTT TTTTCTATCG AAATTACTAA TCAGTTGGGG AAAAAAATAC 4560 
TATAGCAGAC AGCACTAATG TCATCAACAA ACATTGTTCT TCTCCGTGTC CTGGGTACAA 4620 

10 CATCGAATAA TATTTCTTGG CCTCCTTTCC GCTTCTCCTC TCTGCTGTTC CTCTCTACAA 4680 

GAACCTGGGA GGCCAACGCC TAA AGATCAT AATATCACAA TGGAAGGAAC CTAGATTCCT 4740 
AAATGACTGC AT AG G AC AG A TCCCATCTCC TCCACCCAAT ACATTATTACi AGTGAACTGT 4800 
GACCTGAAAT GAGCAATAAA CTCTGTATTA ATTCACTGAA ATGTTGGGGT TGCTTGTTAT 4S60 
AGTAGTCGGT CCATCATGAC CAGTAAAACA TAAATCAAAA GTTAATGTAA TTGTTATCCC 4920 

1 5 ATTATTTAG A GCGAA ATAAA TGTTG AATAT ATGGACTTTC TCAGATTAGG AAATACCAAT 4980 
TAAAAATATA ATAAATAGCT 

Seq ID NO: 1 OS Protein sequence: 
Protein Accession ft NP 003670. 1 

20 

I 1! 21 31 41 51 

MDSSVIQRKK VAVIGGGLVG SLQACFLAKR NFQIDVYEAR EDTRVATFTR GRSINLALSH 60 
RGRQALKAVG LEDQIVSQGI PMRARMIHSL SGKKSAIPYG TKSQYILSVS RENLNKDLLT 1 20 

25 AAEKYPNVKM HFNHRLLKCN PEEGMITVLG SDKVPKDVTC DLIVGCDGAY STVRSHLMKK ISO 
PRFDYSQQYI PHGYMELTIP PKNGDYAMEP NYLHIWPRNT FMMIAU'NMN KSF TCTLFMP 240 
FEEFBKLLTS NDVV DFFQKY R'D AIPL1GE KIXVQDFF1X PAQPMISVKC SSFHFKSHCV 300 
LLGDAAHAIV PFFGQGMNAG FBUCLVFDEL MDKFSNDLS1, Cl.PVFSKLRI PDDHAISDLS 360 
MYNYHSMRAH VNSSWPIPQX NMERFLHAIM PSTFJPI.YTM VTPSRIRYHE AVQRWHWQKK 420 

30 VINKGLFFLG SUAISSTYL L1HYMSPRSF LCLRRPWNWI AHFRNTTCFP AKAVDSLEQI 480 
SNLISR 

Seq ID NO: 109 DNA sequence 
Nucleic Acid Accession if: NM_0O61 15.1 

35 Coding sequence: 236-1 765 (underlined sequences correspond to start and slop cc-dons) 

1 11 21 31 41 51 

GCTTCAGGGT ACAGCTCCCC CGCAGCCAGA AGCCGGGCCT GCAGCCCCTC AGCACCGCTC 60 

40 CGGGACACCC CACCCGCTTC CCAGGCGTGA CCTGTCAACA GCAACTTCGC GGTGTGGTGA 120 
ACTCTCTGAG GAAAA ACCAT TTTGATTATT ACTCTCAGAC GTGCGTGGCA ACAAGTGACT 1 SO 
GAGACCTAGA AATCCAAGCG TTGGAGGTCC TGAGGCCAGC CTAAGTCGCT TCAAAATGGA 240 
ACGAAGGCGT TTGTGGGGTT CCATTCAGAG CCGATACATC AGCATGAGTG TGTGGACAAG 300 
CCCACGGAGA CTTGTGGAGC TGGCAGGGCA GAGCCTGCTG AAGGATGAGG CCCTGGCCAT 360 

45 fGCCGCCCK GACn-reCTGC CCAGGGAGC1 CTTCCCGCXIA ( ICTTCATGG CAGC< TTTGA 12'' 
CGGGAGACAC AGCCAGACCC TGAAGGCAAT GGTGCAGGCC TGGCCCTTCA CCTGCCTCCC 480 
TCTGGGAGTG CTGATGAAGG GAC AACATCT TCACCTGG AG ACCTTCAAAG CTGTGCTTG A 540 
TGGACTTGAT GTGCTCCTTG CCCAGGAGGT TCGCCCCAGG AGGTGGAAAC TTCAAGTGCT 600 
GGATTTACGG AAGAACTCTC ATCAGGACTT CTGGACTGTA TGGTCTGGAA ACAGGGCCAG 660 

50 TCTGTACTCA TTTCCAGAGC CAGAAGCAGC TCAGCCCATG ACAAAGAAGC GAAAAGTAGA 720 
TGGTTTGAGC ACAGAGGCAG AGCAGCCCTT CATTCCAGTA GAGGTGCTCG TAGACCTGTT 780 
CCTCAAGGAA GGTGCCTGTG ATGAATTGTT CTCCTACCTC ATTGAGAAAG TGAAGCGAAA 840 
GAAAAATGTA CTACGCCTGT GCTGTAAGAA GCTGAAGATT TTTGCAATGC CCATGCAGGA SCO 
TATCAAGATG ATCCTGAAAA TGGTGCAGCT GGACTCTATT GAAGATTTGG AAGTGACTTG 960 

55 TACCTG G A AG CTACCCACCT TGGCGAAATT TTCTCCTTAC CTGGGCCAG A TG ATTAATCT 1020 
GCGTAGACTC CTCCTCTCCC ACATCCATGC ATCTTCCTAC ATTTCCCCGG AGAAGGAAGA 1080 
GCAGTATATC GCCCAGTTCA CCTCTCAGTT CCTCAGTCTG CAGTGCCTGC AGGCTCTCTA 1 1 40 
TGTGGACTCT TTATTTTTCC TTAGAGGCCG CCTGGATCAG TTGCTCAGGC ACGTGATGAA 1200 
CCCCTTGGAA ACCCTCTCAA TAACTAACTG CCGGCTTTCG GAAGGGGATG TGATGCATCT 1260 

6 0 GTCCCAG AGT CCCAGCGTCA GTCAGCTAAG TGTCCTGAGT CTAAGTGGGG TCATGCTGAC 1 320 
CGATGTAAGT CCCGAGCCCC TCCAAGCTCT GCTGGAGAGA GCCTCTGCCA CCCTCCAGGA 1380 
CCTGGTCTTT GATGAGTGTG GGATCACGGA TLj \TC ]'C CTTG CI C T* I r TTCCCT 1440 
GAGCCACTGC TCCCAGCTTA CAACCTTAAG CTTCTACGGG AATTCCATCTCCATATCTGC 1500 
CTTGCAGAGT CTCCTGCAGC ACCTCATCGG GCTGAGCA AT CTG ACCCACG TGCTGTATCC 1560 

65 TGTCCCCCTG GAGAGTTATG AGGACATCCA TGGTACCCTC CACCTGGAGA GGCTTGCCTA 1 620 
TCTGCATGCC AGGCTCAGGG AGTTGCTGTG TGAGTTGGGG CGGCCCAGCA TGGTCTGGCT 1680 
TAGTGCCAAC CCCTGTCCTC ACTGTGGGGA CAGAACCTTC TATGACCCGG AGCCCATCCT 1740 
GTGCCCCTGT TTCATGCCTA AC TAG CTGGG TGCACATATC A AATGCTTCA TTCTGCATAC 1 800 
TTGGACACTA AAGCCAGGAT GTGCATGCAT CTTGAAGCAA CAAAGCAGCC ACAGTTTCAG 1860 

70 ACAAATGTTC AGTGTGAGTG AGGAAAACAT GTTCAGTGAG GAAAAAACAT TCAGACAAAT 1920 
GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAGA TGTTGACTTG AGGAGTTAAT L9S0 
GTGATCTTTG GGGAGATACA TCTTATAGAG TTAGAAATAG AATCTGAATT TCTAAAGGGA 2040 
GATTCTGGCT TGGGAAGTAC ATGTAGGAGT TAATCCCTGT GTAGACTGTT GTAAAG AAAC 2100 
TGTTGAAAAT AAAGAGAAGC AATGTGAAGC AAAAAAAAAA AAAAAAAA 

75 

Seq ID NO: 1 10 Protein sequence: 
Protein Accession ft NFJ>061 06. ! 
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I II 21 31 41 51 
I I I I I 1 

MERRRLWGS1 QSRY1SMSVW TSPRELVELA GQSLLKDEAL AIAALELLPR ELFPPLFMAA 60 
FDGRHSQTLK AMVQAWPFTC LPLGVLMKGQ HLHLETFKAV LDGLDVLLAQ EVRPRRWKLQ 1 20 
VLDLRKNSHQ DFWTVWSGNR ASLYSFPEPE AAQFMTKKRK VDGLSTEAEQ PFIPVEVLVD 1 80 
LFLKEGACDE LFSYLIEKVK FJCKNVLRLCC KXLKIFAMPM QDKCMLKMV QLDSIEDLEV 240 
TCTWKLPTLA KFSPYLGQMI NLRRLLLSHI HASSYISPEK EEQYIAQFTS QFLSLQCLQA 300 
LYVDSLFFLR GRLDQLLRHV MNPLETLSIT NCRLSEODVM HLSQSPSVSQ LSVLSLSGVM 360 
LTDVSPEPLQ ALLERASATL QDLVFDECGI TDDQLLALLP SLSHCSQLTT LSFYGNS1SI 420 
SALQSLLQHL IG1.SNLTHVL YPVPLESYED 1HGTLHLERL AYLHARLREL LCELGRPSMV 480 
WLSANPCl'HC GDRTFYDPEP ILCPCFMPN 



Seq ID NO: 111 DNA sequence 

Nucleic Acid Accession ft NM.003SI5 

Coding sequence: 8-2452 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

CGCTGCCATG CGGCTGGCGC TGCTCTGGGC CCTGGGGCTC CTGGGCGCGG GCAGCCCTCT 60 
GCCTTCCTGG CCGCTCCCAA ATATAGGTGG CACTGAGGAG CAGCAGGCAG AGTCAGAGAA 120 
GGCCCCGAGG GAGCCCTTGG AGCCCCAGGT CCTTCAGGAC GATCTCCCAA TTAGCCTCAA 180 
AAAGGTGCTT CAGACCAGTC TGCCTGAGCC CCTGAGGATC AAGTTGGAGC TGGACGGTGA 240 
CAGTCATATC CTGGAGCTGC TACAGAATAG GGAGTTGGTC CCAGGCCGCC CAACCCTGGT 300 
GTGGTACCAG CCCGATGGCA CTCGGGTGGT CAGTGAGGGA CACACTTTGG AGAACTGCTG 360 
CTACCAGGGA AGAGTGCGGG GATATGCAGG CTCCTGGGTG TCCATCTGCA CCTGCTCTGG 420 
GCTCAGAGGC TTGGTGGTCC TGACCCCAG A GAOAAOCTAT ACCCTGGAGC AGGGGCCTGG 480 
GGACCTTCAG GGTCCTCCCA TTATTI'CGCG AATCCAAGA1 ( l 1 Vt iC rjAGGC-'CACAC 540 
CTGTGCGCTG AGCTGGCGGG AATCTGTACA CACTCAGACG CCACCAGAGC ACCCCCTGGG 600 
ACAGCGGCAC ATTCGCCGOA GGCGGGATGT GGTAACAQAG ACCAAGACTG TGGAGTTGGT 660 
GATTGTGGCT GATCACTCGG AGGCCCAGAA ATACCGGGAC TTCCAGCACC TGCTAAACCG 720 
CACACTGGAA GTGGCCCTCT TGCTGG ACAC ATTCTTCCGG CCCCTG AATG TACG AGTGGC 780 
ACTAGTGGGC CTGGAGGCCT GGACCCAGCG TGACCTGGTG GAGATCAGCC CAAACCCAGC 840 
TGTCACCCTC GAAAACTTCC TCCACTGGCG CAGGG CACAT TTGCTGCCTC GATTGCCCCA 900 
TGACAGTGCC CAGCTGGTGA CTGGTACTTC ATTCTCTGGG CCTACGGTGG GCATGGCCAT 960 
TCAGAACTCC ATCTGTTCTC CTGACTTCTC AGGAGGTGTG AACATGGACC ACTCCACCAG 1020 
CATCCTGGGA GTCGCCTCCT CCATAGCCCA TGAGTTGGGC CACAGCCTGG GCCTGGACCA 10SD 
TGATTTGCCT GGGAATAGCT GCCCCTGTCC AGGTCCAGCC CCAGCCAAGA CCTGCATCAT 1 140 
GGAGGCCrCC ACAGACTTCC TACCAGGCCT GA ACTTCAGC AACTGCAGCC GACGGGCCCT 1 200 
GGAGAAAGCC CTCCTGGATG GAATGGGCAG CTGCCTCTTC GAACGGCTGC CTAGCCTACC 1260 
CCCTATGGCT GCTTTCTGCG GAAATATGTT TGTGGAGCCG GGCGAGCAGT GTGACTGTGG i 320 
CTTCCTGGAT GACTGCGTCG ATCCCTGCTG TGATTCTTTG ACCTGCCAGC TGAGGCCAGG 13S0 
TGCACAGTGT GCATCTGACG GACCCTGTTG TCAAAATTGC CAGCTGCGCC CGTCTGGCTG 1440 
( iC AG1 GTCGT CCTACCAGAG GGGATTGTGA CTTGCCTGAA TTCTGCCCAG GAGACAGCTC 1 500 
CCAOTGTCCC CCTGATGTCA GCCTAGGGGA TGGCGAGCCC TGCGCTGGCG GGCAAGCTGT 1 560 
GTGCATGCAC GGGCGTTGTG CCTCCTATGC CCAGCAGTGC CAGTCACTTT GGGGACCTGG 1620 
AGCCCAGCCC GCTGCGCCAC TFTGCCTCCA GACAGCTAAT ACTCGGGGAA ATGCTTTTGG 16S0 
GAGCTGTGGG CGCAACCCCA GTGGCAGTTA TGTGTCCTGC ACCCCTAGAG ATGCCATTTG 1 740 
TGGGCAGCTC CAGTGCCAGA CAGGTAGGAC CCAGCCTCTG CTGGGCTCCA TCCGGGATCT 1800 
ACTCTGGGAG ACAATAGATG TGAATGGGAC TGAGCTGAAC TGCAGCTGGQ TGCACCTGGA 1860 
CCTGGGCAGT GATGTGGCCC AGCCGCTCCT GACTCTGCCT GGCACAGCCT GTGGCCCTGG 1920 
CCTGGTGTGT ATAGACCATC GATGCCAGCQ TGTGGATCTC CTGGGGGCAC AGGAATGTCG 1980 
AAGCAAATGC CATGGACATG GGGTCTGTGA CAGCAACAGG CACTGCTACT GTGAGGAGGG 2040 
CTGGGCACCC CCTGACTGCA CCACTCAGCT CAAAGCAACC AGCTCCCTGA CCACAGGGCT 2100 
GCTCCrCAGC CTCCTGGTCT TATTGGTCCT GGTGATGCTT GGTGCCGGCT ACTGGTACCG 2 1 60 
TGCCCGCCTG CACCAGCGAC TCTGCCAGCT CAAGGGACCC ACCTGCCAGT ACAGGGCAGC 2220 
CCAATCTGGT CCCTCTGAAC GGCCAGGACC TCCGCAGAGG GCCCTGCTGG CACGAGGCAC 2280 
TAAGTCTCAG GGGCCAGCCA AGCCCCCACC CCCAAGG AAG CCACTGCCTG CCGACCCCCA 2340 
GGGCCGGTGC CCATCGGGTG ACCTGCCCGG CCCAGGGGCT GGAATCCCGC CCCTAGTGGT 2400 
AOCCTCCAGA CCAGCGCCAC CGOCTCCGAC AGTGTCCTCG CTC rACCTCTG A.CCTCTCCG 2460 
GAGGTTCCGC TGCCTCCAAG CCGGACTTAG GGCTTCAAGA GGCGGUCGTG CCCTCTGGAG 2520 
TCCCCTACCA TGACTGAAGG CGCCAGAGAC TGGCGGTGTC TTAAGACTCC GGGCACCGCC 25B0 
ACGCGCTGTC AAGCAACACT CTGCGGACCT GCCGGCGTAG TTGCAGCGGG GGCTTGGGGA 2640 
GGGGCTGGGG GTTGGACGGG ATTGAGGAAG GTCCGCACAG CCTGTCTCTG CTCAGTTGCA 2700 
ATAAACGTGA CATCTTGGGA GCGTTAAAAA AAAAAAAAAA 

Seq ID NO: 1 12 Protein sequence; 
Protein Accession ft NP_003S06.2 

1 11 21 31 41 51 

MRLALLWALG LLGAGSPLPS WPLPNIGGTE EQCJAESEKAP REPLEPQVLQ DDLPISLKKV 60 
LQTSLPEPLR IKLELDGDSH ILELLQNREL VPGRPTLVWY QPDGTRWSE GHTLENCCYQ 120 
GRVRGYAGSW VSICTCSGLR GLVVLTPERS YTLEQGPGDL QGPP1ISPJQ DLHLPGHTCA 180 
LSWRESVHTQ TPPEHPLGQR HIRRKRDVVT BTK.TVELV1V ADHSEAQKYR DFQHLLNRTL 240 
EVALLLDTFF RPLNVRVALV OLEAWTQRDL VEISPNPAVT LENFLHWRRA HLLPRLPHDS 300 
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AQLVTGTSFS GPTVGMAIQN SICSPDFSGG VNMDHSTS1L GVASSIAHEL GHSLGLDHDL 360 
PGNSCPCPGP APAKTCIMEA STDFLPGLNF SNCSRRALEK ALLDGMGSCL FERLPSLPPM 420 
AAFCGNMFVE PGEQCDCGFL DDCVDPCCDS LTCQLRPG AQ CASDGPCCQN CQLRPSGWQC 4S0 
RPTRGDCDLP EFCFGDSSQC PPDVSLGDGE FCAGGQAVCM HGRCASYAQQ CQSLWGFGAQ 540 
PAAPLCLQTA NTRGNAFGSC GRNPSGSYVS CTPRDAICGQ LQCQTGRTQP LLGSIRD1XW 600 
ETID VNGTEL NCSWVHLDLG SD VAQPLLTL PGTACGPGLV C1DHRCQRVD LLGAQECRSK 660 
CHGHGVCDSN RHCYCEEGWA PPDCTTQLKA TSSLTTGLLL SLLVLLVLVM LGAGYWYRAR 720 
LHQRLCQLKG PTCQYRAAQS GPSERPGPPQ RALLARGTKS QGPAKPPPPR KPLPADFQGR 780 
CPSGDLPGPG AGIPPLVVPS RPAPFPPTVS SLYL 

Sen; ID NO: 1 13 DNA sequence 

Nucleic Acid Accession*: NMJM2416 

Coding sequence: 40-417 (underlined sequences correspond to start and stop codons) 
1 II 21 31 41 SI 

ATCCAATACA GGAGTGACTT GGAACTCCAT TCTATCACTATGAAG AAA AG TGGTGTTCTT 60 
TTCCTCTTGG GCATCATCTT GCTGGTTCTG ATTGG AGTGC AAGGAACCCC AGTAGTG AGA 1 20 
AAGGGTCGCT GTTCCTGCAT CAGCACCAAC CAAGGGACTA TCCACCTACA ATCCTTGAAA 1 B0 
GACCTTAAAC AATTTGCCCC AAGCCCTTCC TGCGAGAAAA TTGAAATCAT TGCTACACTG 240 
AAGAATGGAG TTCAAACATG TCf AAACCCA GATTCAGCAG ATGTGAAGGA ACTGATTAAA 300 
AAGTGGGAGA AACAGGTCAG CCAAAAGAAA AAGCAAAAGA ATGGGAAAAA ACATCAAAAA 360 
AAGAAAGTTC TGAAAGTTCG AAAATCTCAA CGTTCTCGTC AAAAGAAGAC TACATAAGAG 420 
ACCACTTCAC CAATAAGTAT TCTGTGTTAA AAATGTTCTA TTTTAATTAT ACCGCTATCA 480 
TTCCAAAGGA GGATGGCATA TAATACAAAG GCTTATTAAT TTGACTAGAA AATTTAAAAC S40 
ATTACTCTGA AATTGTAACT AAAGTTAGAA AGTTGATTTT AAGAATCCAA ACGTTAAGAA 600 
TTOTTAAAGG CTATGATTGT CTTTGTTCTT CTACCACCCA CCAGTTGAAT '7TCATCATGC 660 
'I-fAAGGCCAT GATTTTAOCA ATaCCCATGT CTACACAG AT GTTCACCCAA CCACA'l CCCA 720 
CTCA ACAGC 3CCTG * OA CAGCCC1 GOCTTCC "'1 CKSCAG C ICAGAGAG 780 
TATCTGAGGC ACATGTCAGC AAGTCCTAAG CCTGTTAGC A TGCTGGTGAG CC AAGCAGTT 840 
TGAAATTGAG CTGGACCTCA CCAAGCTG CT GTGGCCATCA ACCTCTGTAT TTGA ATCAGC 900 
CTACAGGCCT CACACACAAT GTGTCTG AGA GATTCATGCT GATTGTTATT GGGTATCACC 960 
ACTGGAGATC ACCAGTGTGT GGCTTTCAGA GCCTCCTTTC TGGCTTTGGA AGCCATGTGA 1020 
TTCCATCTTG CCCGCTCAGG CTGACCACTT TATTTCTTTT TGTTCCCCTT TGCTTCATTC 1080 
AAGTCAGCTC TTCTCCATCC TACCACA ATG CAGTGCCTTT CTTCTCTCCA GTGCACCTGT U40 
CATATGCTCT GATTT ATCTG AGTCAACTCC TTTCTCATCT TGTCCCCAAC ACCCCACAGA 1200 
AGTGCTTTCT TCTCCCAATT CATCCTCACT CAGTCCAGCT TAGTTC AAGT CCTGCCTCTT 1260 
AAATAAACCT TTTTGGACAC ACAAATTATC TTAAAACTCC TGTTTCACTT GGTTCAGTAC 1320 
CACATGGGTG AACACTCAAT GGTTAACTAA TTCTTGGGTG TTTATCCTAT CTCTCCAACC 1380 
AGATTGTCAG CTCCTTGAGG GCAAGAGCCA CAGTATATTT CCCTGTTTCT TCCACAGTGC 1440 
CTAATAATAC TGTGGAACTA OGTTTTAATA ATTTTTTAAT TGATGTTGTT ATGGGCAGGA 1500 
TGGCAACCAG ACCATTGTCT CAGAGCAGGT GCTGGCTCTT TCCTGGCTAC TCCATGTTGG 1 560 
CTAGCCTCTG GTAACCTCTT ACTTATTATC TTCAGGACAC TCACTACAGG GACCAGGGAT 1620 
GATGCA ACAT CCTTGTCTTT TTATGACAGG ATGTTTGCTC AWHTCTCCA A C A ATA AG A A • 680 
GCACGTGGTA AAACACTTGC GGATATTCTG GACTGTTTTT AAAAAATATA CAGTTTACCG 1 740 
AAAATCATAT AATCTTACAA TGAAAAGGAC TTTATAGATC AGCCAGTGAC CAACCTTTTC 1800 
CCAACCATAC AAAAATTCCT TTTCCCG A AG G A A A AGGGCT TTCTCAATAA GCCTCAOCTT 1 860 
TCTAAGATCT AACAAGATAG CCACCGAGAT CCTTATCGAA ACTCATTTTA GGCAAATATG 1920 
AGTTTTATTG TCCGTTTACT TGTTTCAGAG TTTGTATTGT GATTATCAAT TACCACACCA 1980 
TCTCCCATGA AGAAAGGGAA CGGTGAAGTA CTAAGCGCTA GAGGAAGCAG CCAAGTCGGT 2040 
TAGTGGAAGC ATGATTGGTG CCCAGTTAGC CTCTGCAGGA TGTGGAAACC TCCTTCCAGG 2100 
GGAGGTTCAG TGAATTGTGT AGGAGAGGTT GTCTGTGGCC AGAATTTAAA CCTATACTCA 21 60 
CTTTCCCAAA TTGAATCACT GCTCACACTG CTGATGATTT AGAGTGCTGT CCGGTGGAGA 2220 
TCCCACCCGA ACGTCTTATC TAATCATGAA ACTCCCTAGT TCCTTCATGT AACTTCCCTG 2280 
AAAAATCTAA GTGTTTCATA AATTTGAGAG TCTGTGACCC ACTTACCTTG CATCTCACAG 2340 
GTAGACAGTA TATAACTAAC AACCAAAGAC TACATATTGT CACTGACACA CACGTTATAA 2400 
TCATTTATCA TATATATACA TACATGCATA CACTCTCAAA GCA AATA ATT TTTCACTTCA 2460 
AAACAGTATT GAGTTGTATA CCTTGTAATT TGAAATATTT TCTTTGTTAA AATAGAATGG 2520 
TATCAATAAA TAGACCATTA ATCAG 

Seq ID NO: 1 14 Protein sequence: 
Protein Accession II: NPJ102407 

1 11 21 35 41 51 

MKKSGVLfLl GI1LLVL1GV QGTPWRKGR CSCISTNQGT IHLQSLKDLK QFAPSPSCEK 60 
IEIIATLKNG VQTCLMPDS A DVKELKKWE K.QVSQKKKQK NGKKHQKKKV LKVRKSQRSR 120 . 
QKKTT 

Seq ID NO: 115 DNA sequence 

Nucleic Acid Accession/*: NM_003238.1 

Coding sequence: 182-1426 (underlined sequences correspond to start and slop codons) 
1 11 21 31 41 51 

CAAGCAGGAT ACGTTTTTCT GTTGGGCATT GACTAGATTG TTTGCAAA AG TTTCGCATCA 60 
AAAACAAACA ACAACAACAA AAAACCAAAC AACTCTCCTT GATCTATACT TTGAGAATTG 120 
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TTOATTTCTT TTTTTTTATT CTGACTTTTA AA A ACAACTT TTTTTTCCAC TTTTTTAAAA 1 SO 
AATGCACTAC TGTGTGCTGA GCGCTTTTCT GATCCTOCAT CTGOTCACGG TCGCGCTCAG 240 
CCTGTCTACC TGCAGCACAC TCG ATATGGA CCAGTTCATG CGCA AGAGGA TCG AGGCGAT 300 
CCGCGGOCAG ATCCTGAGCA AGCTGAAGCT CACCAGTCCC CCAGAAGACT ATCCTGAGCC 360 
CGA6GAAGTC CCCCCGGAGG TGATTTCCAT CTACAACAGC ACCAGGGACT TGCTCCAGGA 420 
GAAGGCGAGC CGGAGGGCGG CCGCCTGCG A GCGCGAGAGG AGCGACGAAG AGTACTACGC 480 
CA AGGAGGTT TACAAAATAG ACATGCCGCC CTTCTTCCCC TCCGAAAATG CCATCCCGCC 540 
CACTTTCTAC AGACCCTACT TCAGAATTGT TCGATTTGAC GTCTCAGCAA TGGAGAAGAA 600 
TGCTTCCAA7 TTGGTGAAAG CAGAGTTCAG AGTCTTTCGT TTGCAGAACC CAAAAGCCAG 660 
AGTGCCTGAA CAACGGATTG AGCTATATCA GATTCTCAAG TCCAAAGATT TAACATCTCC 720 
AACCCAGCGC TACATCGACA GCAAAOTTOT GAAAACAAGA GCAGAAGGCG AATGGCTCTC 780 
CTTCGATGTA ACTGATGCTG TTCATGAATG GCITCACCAT AAAGACAGGA ACCTGGfiATT 840 
TA AAATAAGC TTACACTGTC CCTGCTGCAC TTTTGTACCA TCTAATAATT ACATCATCCC 900 
AAATAAAAGT GAAGAACTAG AAGCAAGATf TGCAGGTATT GATGGCACCT CCACATATAC 960 
CAGTGGTGAT CAGAAAACTA TAAAGTCCAC TAGGAAAAAA AACAGTGGGA AGACCCCACA 1020 
TCTCCTGCTA ATGTTATTGC CCTCCTACAG ACTTG AGTCA CAACAG ACCA ACCGGCGG AA 1 080 
GAAGCGTGCT TTGGATGCGG CCTATTGCTT TAGAAATGTG CAGGATAATT GCTGCCTACG 1 140 
TCCACTTTAC ATTGATTTCA AGAGGGATCT AGGGTGGAAA TGGATACACG AACCCAAAGG 1200 
GTACAATGCC AACTTCTGTG CTGGAGCATG CCCGTATTTA TGGAGTTCAG ACACTCAGCA 1260 
C AGCAGGGTC CTGAGCTTAT ATAATACCAT AAATCCAG AA GCATCTGCTT CTCCTTGCTG 1 320 
GGTGTCCCAA G ATTTAG AAC CTCTAACCAT TCTCTACTAC ATTGGCAAAA CACCCAAGAT 1 380 
TGAACAGCTT TCTAATATGA TTGTAA AGTC TTGCAA ATGC AGCTAAAATT CTTGGA A AAG 1 440 
TGGCAAGACC AAAATGACAA TGATGATGAT AATGATGATG ACGACGACAA CGATGATGCT 1500 
TGTAACAAGA AAACATAAG A G AGCCTTGGT TCATCAGTGT TAAAAA ATTT TTGAAAAGGC 1 560 
GGTACTAGTT CAG ACACTTT GGAAGTTTGT GTTCTGTTTG TTAAAACTGG CATCTG ACAC 1 620 
AAAAAAAGTT GAAGGCCTTA TTCTACATTT CACCTACTTT GTAAGTGAGA GAGACAAGAA 1680 
GCAAATTTTT TTAAA 



1 11 21 31 41 51 
) ) 1 I I 1 

MHYCVLSAFL ILHLVTVALS LSTCSTLDMD QFMRKRIEAI RGQILSK.LKL TSPPBDYPEP 60 
EEVPPEVIS1 YNSTRDLLQE KASRRAAACE RERSDEEYYA KEVYKIDMPP FFPSENAIPP 120 
TFYRPYFRIV RFDVSAMEKN ASNLVKAEFR VFRLQNPKAR VPEQRIELYQ ILKSKDLTSF 1 SO 
TQRY1DSKVV KTRAEGEWLS FDVTDAVHEW LHHKDRNLGF KISLHCPCCT FVPSNNYIIP 240 
NKSEELEARF AGIDGTSTYT SGDQKTKST RKKN SGKTPH LLLMLLPSYR LESQQTNRRK 300 
KRALDAAYCF RNVQDNCCLR FLYIDFKRDL GWKW1HEPKG YNANFCAGAC PYLWSSDTQH 360 
SRVLSLYNTI NPEASASPCC VSQDLEPLTI LYYIOKTPKI EQLSNMIVKS CKCS 

Scq ID NO: 1 i7 DNA sequence 

Nucleic Acid Accession #: NM_000095. 1 

Coding sequence: 26-2299 (underlined sequences correspond to start and stop cortons) 
1 11 21 31 41 51 

CAGC ACCCAG CTCCCCGCCA CCGCCATGGT CCCCG ACACC GCCTGCGTTC TTCTGCTCAC 60 
CCTGGCTGCC CTCGGCGCGT CCGGACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 120 
GCAGATGCTT CGGGAACTGC AGG AAACCAA CGCGGCGCTG CAGGACGTGC GGGACTGGCT 1 80 
GCGGCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACG GTGATGGAGT GTGACGCGTG 240 
CGGGATGCAG CAGTCAGTAC GCACCGGCCf ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 
OCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 360 
CCCCTGCCCC GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 
CGCCCACCCC TGCTTCCCCC GAGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 
GGCTTGCCCG CCGGGGTACA QCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 
GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 
CCCCAACTCC GTGTijCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 
CTTCGTGGGC GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 
CTCGCCCAGC GAGTGCCACG AGCATGCAG A CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 
TGCGTGTGT CGCGTTGGCT GGGCCGGCAA C< iAl T< ft 1 jli GO CAC1 j i M> 
OCTTC< 3GACG 3CTGC i. "LCGGAGCCG CAGTGCCGTA AGGACAACTG 900 

CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 
CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1020 
GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 1080 
GTCCCAGAAG AACGACGACC AAA AGG ACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1140 
CGACG ACATC G ACGGCGACC GG ATCCGCAA CCAGGCCGAC A ACTCCCCTA GGGTACCCAA 1200 
CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGGGGAT GCCTGTGACA ACTGTCCCCA 1 260 
GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 
CGATCAAGAC CAGGATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCACGGTGCC 1380 
TAACAGTGCC CAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCGTGCG ACGACGACGA 1440 
CGACAATGAC GGAGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGTGCCTA ACCCCGGCCA 1500 
GGAGGACGCG GACAGGGACG GCGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 
GGTGGTAG AC AAG ATCG ACQ TGTGTCCGG A G AACGCTGAA GTCACGCTCA CCGACTTCAG i 620 
GGCCTTCCAG ACAGTCGTGC TGGACCCGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 1680 
GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 
GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 
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GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT 1860 
CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 1920 
GCCTGGCATC CAACTCAAGG CTGTG AAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 
CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCCCG 2040 
5 AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2 ! 00 
GGGCTACATC AGGGTGCGAT TCTATGAGGG CCCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 
CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 
CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 
TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGGACCCGCC GGATGACAGC CACCCTCACC 2340 
1 0 GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 
AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 

Seq ID NO: 118 Protein sequence: 
^ Protein Accession tt; NPJM0OS6.1 

! 11 21 31 41 51 

I I ! I I 1 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVKD WLRQQVREIT SO 
FLKNTVMECD ACGMQQSVRT GLPS VRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 120 

20 NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPFGYSG PTHQG VGLAF AKANKQVCTD 1 80 
[NECETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 
ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGFPDEKL RCPEPQCRKD NCVTVPNSGQ 300 
ED VDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD EDKWGD ACDN CRSQKNDDQK 360 
DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DG1GDACDNC PQKSNPDQAD 420 

25 VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 4S0 
RDNCELVPNP GQEDADRDGV GDVCQDDFDA DKVVDKIDVC PENAEVTLTD FRAFQTVVLD 540 
PEGDAQIDPN WVVLNQGREI VQTMNSDPGL AVOYTAFNGV DFEGTFHVNT VTDDDYAGF1 600 
rGYQDSSSI Y VVMWkQMI J'J' V j f <A V iH LKA\ ^SKiFGEQLRNALWHTGDT 660 
ESQVR1XWKD PRNVGWKDKKSYRWFLQHRP QVGYIRVRFYEGPELVADSN VVLDTTMRi'iG 720 

30 RLG VFCFSQE NISWANLRYR CNDTIPEDYE THQLRQA 

Seq ID NO: 1 1 9 DNA sequence 
Nucleic Acid Accession ft NMM421 1 

^ j_ Coding sequence: 157-1479 (underlined sequences correspond to start and slop codons) 

i II 21 31 41 51 
I I I I 1 I 

GGGACAGGGC TGAGGATGAG GAGAACCCTG GGGACCCAGA AGACCGTGCC TTGCCCGGAA 60 
GTCCTGCCTG TAGGCCTGAA GGACTTGCCC TAACAGAGCC TCAACAACTA CCTGGTGATT 120 
40 CCTACTTCAG CCCCTTGGTG TGAGCAGCTT CTCAACATGA ACTACAGCCT CCACTTGGCC 1 80 
TTCGTGTGTC TGAGTCTCTT CACTGAGAGG ATGTGCATCC AGGGGAGTCA GTTCAACGTC 240 
GAGGTCGGCA GAAGTGACAA GCTTTCCCTG CCTGGCTTTG AGAACCTCAC AQCAGG ATAT 300 
AACAAATTTC TCAGGCCCAA TTTTGGTGGA GAACCCGTAC AGATAGCGCT GACTCTGGAC 360 
ATTGCAAGTA TCYCTAGCAT TTCAGAG ACT AACATGGACT ACACAGCCAC CATATACCTC 420 

45 coACAGCncr ggatggacga GCGGcrrocnro 'ittgaaggga acaagagctt cactctggat mo 

GCCCGCCTGCi TGGAGTTCCT CTGGOTGCCA GATACTTACA TTGTGGAGTC CAAGAAGTCC 540 
TTCCTCCATO AAGTCACTGT GGGAA ACAGG CTCATCCGCC TCTTCTCCAA TGGCACGGTC 500 
CTGTATGCCC TCAGAATCAC GACAACTGTT GCATGTAACA TGGATCTGTC TAAATACCCC 660 
ATGGACACAC AGACATGCAA GTTGCAGCTG GAAAGCTGGG GCTATGATGG AAATGATGTG 720 

5 0 GAGTTCACCT GGCTG AG AGG GA ACGACTCT GTGCGTGGAC TGGAACACCT GCGGCTTGCT 780 
CAGTACACCA TAGAGCGGTA TTTCACCTTA GTCACCAGAT CGCAGCAGGA GACAGGAAAT 840 
TACACTAGAT TGGTCTTACA GTTTGAGCTT CGGAGGAATG TTCTGTATTT CATTTTGGAA 900 
ACCTACGTTC CTTCCACTTT CCTGGTGGTG TTGTCCTGGG TTTCATTTTG GATCTCTCTC 960 
GATTCAGTCC CTGC A AGAAC CTGCATTGGA GTGACGACCG TGTTATCAAT GACCACACTG 1 020 

5 5 ATGATCGGGT CCCGCACTTC TCTTCCCAAC ACCAACTGCT TCATCAAGGC CATCG ATGTG 1080 
TACCTGGGGA TCTGCTTTAG CTTTGrGTTT GGGGCCTTGC TAGAATATGC AGTTGCTCAC 1 140 
TACAGTTCCT TACAGCAGAT GGCAGCCAAA GATAGGGGGA CAACAAAGGA AGTAGAAGAA 1200 
GTCAGTATTA CTAATATCAT CAACAGCTCC ATCTCCAGCT TTAAACGGAA GATCAGCrTT 1260 
GCCAGCATTG AAATTTCCAG CGACAACGTT GACTACAGTG ACTTGACAAT GAAAACCAGC 1320 

60 GACAAGTTCA AGTTTGTCTT CCGAGAAAAG ATGGUCAGGA TTGTTGATTA TTTCACAATT 1380 
CAAAACCCCA GTAATGTTGA TCACTATTCC A AACTACTGT TTCCTTTGAT TTTTATGCTA 1440 
GCCAATGTAT TTTACTGGGC ATACTACATG TATITTTGAG TCAATGTTAA ATTTCTTGCA 1500 
TGCCATAGGT CTTCAACAGG ACAAG ATAAT GATGTAAATG GTATTTTAGG CCAAGTGTGC 1560 
. ACCCACATCC AATGGTGCTA CAAGTG ACTG AAATAATATT TG AGTCTTTC TGCTCAAAGA 1 620 

65 ATGAAGCTCC AACCATTGTT CTAAGCTGTG TAGAAGTCCT AGCATTATAG GATCTTGTAA 1680 
TAGAAACATC AGTCCATTCC TCTTTCATCT TAATCAAGGA CATTCCCATG GAGCCCAAGA 1740 
TTACAAATGT ACTCAGGGCT GTTTATTCGG TGGCTCCCTG GTTTOCATTT ACCTCATATA ! 800 
AAGAATGGGA AGGAGACCAT TGGGTAACCC TCAAGTGTCA GAAGTTGTTT CTAAAGTAAC 1860 
TATACATGTT TTTTACTA AA TCTCTGCAGT GCTTATA AAA TACATTGTTG CCTATTTAGG 1 920 

70 GAGTAACATTTTCTAGTTTTTGTTTCTGGTTAAAATGAAATATGGGCTTATGTCAATTCA 1980 
TTGGAAGTCA ATGCACTAAC TCAATACCAA GATGAGTTTT TAAATAATGA ATATTATTTA 2040 
ATACCACAAC agaattatcc ccaatttcca ATAAGTCCTA TCATTGAAAA TTCAAATATA 2100 
AGTGAAGAAA AAATTAGTAG ATCAACAATC TAAACAAATC CCTCGGTTCT AAGATACAAT 2160 
GGATTCCCCA TACTGGAAGG ACTCTGAGGC TTTATTCCCC CACTATGCAT ATCTTATCAT 2220 

75 TTTATTATTA TACACACATG CATCCTAAAC TATACTAAAG CCCTTTTCCC ATGCATGGAT 2280 
GGAAATGGAA GATTTTTTTG TAACTTGTTC TAGAAGTCTT AATATGGGCT GTTGCCATGA 2340 
AGGCTTGCAG AATTGAGTCC ATTTTCTAGC TQCCTTTATT CACATAGTGA TGGGGTACTA 2400 
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AAAGTACTGG GTTGACTCAG AGAGTCGCTG TCATTCTGTC ATTGCTGCTA CTCTAACACT 2460 
GAGCAACACT CTCCCAGTGG CAGATCCCCT GTATCATTCC AAGAGGAGCA TTCATCCCTT 2520 
TGCTCTAATG ATCAGGAATG ATGCTTATTA GAAAACAAAC TGCTTGACCC AGGAACAAGT 2580 
GGCTTAGCTT AAGTAAACTT GGCTTTGCTC AGATCCCTGA TGCTTCCAGC TGGTCTGCTC 2640 
5 TGAGTGGCTT ATCCCGCATG AGCAGGAGCG TGCTGGCCCT GAGTACTGAA CTTTCTGAOT 2700 
AACAATGAGA CACGTTACAG AACCTATGTT CAGGTTGCGG GTGAGCTGCC CTCTCCAAAT 2760 
CCAGCCAGAG ATGCACATTC CTCGGCCAGT CTCAGCCAAC AGTACCAAAA GTGATTTTTG 2820 
AGTGTGCCAG GGTAAAGGCT TCCAGTTCAG CCTCAGTTAT TTTAGACAAT CTCGCCATCT 2680 
TTAATTTCTT AGCTTCCTGT TCTAATAAAT GCACGGCTTT ACCTTTCCTG TCAGAAATAA 2940 

1 0 ACCA AGGCTC TAAAAGATGA TTTCCCTTCT GTAACTCCCT AGAGCCACAG GTTCTCATTC 300O 
CTTTTCCCAT TATACTTCTC ACAATTCAGTTTCTATGAGT TTGATCACCT G ATTTTnTA 3060 
ACAAAATA'i'T TC'l'AACGGGA ATGGGTGGGA GTGCTGGTGA AAAGAGATGA AATGTGGTTG 3120 
TATGAGCCAA TCATATTTGT G ATTTTTTA A AAA AAGTTTA AAAGGAAATA TCTGTTCTG A 3 1 SO 
AACCCCACTT AAGCATTGTT TTTATATAAA AACAATGATA AAGATGTGAA CTGTGAAATA 3240 

15 AATATACCAT ATTAGCTACC CACC 

Protein Accession^ NP 055026.1 

20 1 11 21 31 41 51 

MNYSLHLAFV CLSLFTERMCIQGSQFNVEV GRSDKLSLPG FENLTAGYNK FLRPNFGGEP 60 
VQIALTLDIA SISSISESNM DYTATIYLRQ RWMDQRLVFE GNKSFTLDAR LVEFLWVPDT 120 
YIVESKKSFL HEVTVGNRLI RLFSNGTVLY ALFJTTTVAC NMDLSKYFMD TQTCKLQLES 180 

25 WGYDGNDVEF TWLRGNDSVR GLEHLRLAQY TIERYFTLVT RSQQETGNYT RLVLQFELRR 240 
NVLYFILETY VPSTFLVVLS WVSFW1SLDS VPARTCIGVT TVLSMTTLMI GSRTSLPNTN 300 
CFIKAIDVYL GICFSFVFGA LLEYAVAHYS SLQQMAAK ! )R GTTK EVEEVS ITNIINSSiS 360 
SFKRKISFAS IEISSDNVDY SDLTMKTSDK FKFVFRFXMG RIVDYF'HQN PSNVDHYSKL 420 

^ LFPLIFMLAN VFYWAYYMYF 

Scq ID NO: 121 DNA sequence 

Nucleic Acid Accession ft NM.001 854 

Coiling sequence; 163-5582 (underlined sequences correspond to start and stop codons) 

35 1 11 21 31 41 51 
I I I I I I 

AACCATCAAA TTTAGAAGAA AAAGCCCTTT GACTTnTCC CCCTCTCCCT CCCCAATGGC 60 
TGTGTAGCAA ACATCCCTGG CGATACCTTG GAAAGGACGA AGTTGGTCTG CAGTCGCAAT 120 
TTCGTGGGTT G AGTTCACAG TTGTGAGTGC GGGGCTCGGA G ATG GAGCCG TGGTCCTCTA 1 80 

40 GGTGGAAAAC GAAACGGTGG CTCTGGGATT TCACCGTAAC AACCCTCGCA TTGACCTTCC 240 
TCTTCCAAGC TAGAGAGGTC AGAGGAGCTG CTCCAGTTGA TGTACTAAAA GCACTAGATT 300 
TTCACAATTC TCCAGAGGGA ATATCAAAAA CAACGGGATT TTGCACAAAC AGAAAGAATT 360 
CTAAAGGCTC AGATACTGCT TACAGAGTTT CAAAGCAAGC ACAACTCAGT GCCCCAACAA 420 
AACAGTTATT TCCAGGTGUA ACTTTCCCAG AAGACTTTTC AATACTAT1T ACAGT AAAAC ISO 

45 CAAAAAAAGG AATTCAGTCT TTCCTTTTAT (ffATATATAA TGAGCATGGT ATTCAGCAAA 540 
TTGGTGTTG A GGTTGGG AGA TCACCTGTTT TTCTGTTTGA AG ACCACACT GGAA AACCTG 600 
CCCCAGAAGA CTATCCCCTC TTCAGAACTG TTAACATCGC TGACGGGAAG TGGCATCGGG 660 
TAGCAATCAG CGTGGAGAAG AAAACTGTGA CAATGATTGT TGATTGTAAG AAGAAAACCA 720 
CGAAACCACT TGATAGAAGT GAGAGAGCAA TTGTTGATAC CAATGGAATC ACGGTTTTTG 780 

50 GAACAAGGAT TTTGGATGAA GA AG'mTTG AGGGGGACAT TCAGCAGTTT TTGATCACAG 840 
GTG ATCCCA A GGCAGCATAT GACTACTGTG AGCATTATAG TCCAGACTGT GACTCTTCAG 300 
CACCCAAGGC TGCTCAAGCT CAGGAACCTC AGATAGATGA GTATGCACCA GAGGATATAA 960 
TCGAATATGA GTATGAGTAT GGGGAAGCAG AGTATAAAGA GGCTGAAAGt GTAACAGAGG 1020 
GACCCACTGT AACTGAGGAG ACAATAGCAC AGACGGAGGC AAACATCGTT GATGATTTTC !0S0 

5 5 AAGA ATACAA CTATGGAACA ATGGAAAGTT ACCAG AC AGA AGCTCCTAGG CATGTTTCTG 1140 
QGACAAATGA GCCAAATCCA GTTGAAGAAA TATTTACTGA AGAATATCTA ACGGGAGAGG 1200 
ATTATGATTC CCAGAGGAAA AATTCTGAGG ATACACTATA TGAAAACAAA GAAATAGACG 1260 
GCAGGGATTC TGATCTTCTG GTAGATGGAG ATTTAGGCGA ATATGATTTT TATGAATATA 1320 
AAGAATATGA AGATAAACCA ACAAGCCCCC CTAATGAAGA ATTTGGTCCA GGTGTACCAG 1380 

60 CAGAAACf GA TATTACAGAA ACAAGCATAA ATGGCCATGG TGCATATGGA GAGAAAGGAC 1440 
AGAAAGGAGA ACCAGCAGTG G1TGAGCCTG GTATGCTTGT CGAAGGACCA CCAGGACCAG 1500 
CA fATTATG GGTCCTC CTACA C( 5 v 1 CCC > -> 

ACCCTGGCGA TAGGGGCCCC CCAGGACGTC CTGGCTTACC AGGGGCTGAT GGTCTACCTG 1620 
GTCCTCCTGG TACTATGTTG ATGTTACCGT TCCGTTATGG TGGTGATGGT TCCAAAGGAC 1680 

65 CAACCATCTC TGCTCAGGAA GCTCAGGCTC AAGCTATTCT TCAG CAGGCT CGGATTGCTC 1740 
TGAGAGGCCC ACCTGGCCCA ATGGGTCTAA CTGGAAG ACC AGGTCCTGTG GGGGGGCCTG 1 800 
GTTCATCTGG GGCCAAAGGT GAGAGTGGTG ATCCAGGTCC TCAGGGCCCT CGAGGCGTCC 1860 
AGGGTCCCCC TGGTCCAACG GGAA AACCTG GAAAAAGGGG TCGTCCAGGT GCAGATGGAG 1920 
GAAGAGGAAT GCCAGGAGAA CCTGGGGCAA AGGGAGATCG AGGGTTTGAT GGACTTCCGG 1980 

70 GTCTGCCAGG TGACAAAGGT CACAGGGGTG AACGAGGTCC TCAAGGTCCT CCAGGTCCTG 2040 
CTGGTGATGA TGGAATGAGG GGAGAAGATG GAGAAATTGG ACCAAGAGGT CTTCCAGGTG 2100 
AAGCTGGCCC ACGAGGTTTG CTGGGTCCAA GGGGAACTCC AGGAGCTCCA GGGCAGCCTG 2160 
GTATGGCAGG TGTAGATGGC CCCCCAGGAC CAAAAGGGAA CATGGGTCCC CAAGGGG AGC 2220 
CTGGGCCTCC AGGTCAACAA GGGAATCCAG GACCTCAGGG TCTTCCTGGT CCACAAGGTC 22E0 

75 CAATTGGTCC TCCTGGTGAA AAAGGACCAC AAGGAAAACC AGGACTTGCT GGACTTCCTG 2340 
GTGCTGATGG GCCTCCTGGT CATCCTGGGA AAGAAGGCCA GTCTGGAGAA AAGGGGGCTC 2400 
TGGGTCCCCC TGGTCCACAA GGTCCTATTG GATNNCCGGG CCCCCGGGGA GTAAAGGGAG 2460 
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CAGATGGTGT CAGAGGTCTC AAGGGATCTA AAGGTGAAAA GGGTGAAGAT GGTTTTCCAG 2520 
GATTCAAAGG TGACATGGQT CTAAAAGGTO ACAGAGGAGA AGTTGGTCAA ATTGGCCCAA 2580 
GAGGGNAAGA TGGCCCTGAA GGACCCAAAG GTCGAGCAGG CCCAACTGGA GACCCAGGTC 2640 
CTTCAGGTCA AGCAGGAGAA AAGGGAAAAC TTGGAGTTCC AGGATFACCA GGATATCCAG 2700 
5 GAAGACAAGG TCCAAAGGGT TCCACTGGAT TCCCTGGGTT TCCAGGTGCC AATGGAGAGA 276D 
AAGGTGCACG GGGAGTAGCT OGCAAACCAG GCCCTCGGGG TCAGCGTGGT CCAACGGGTC 2820 
CTCGAGGTTC AAGAGGTGCA AG AGGTCCCA CTGGG AAACC TGGGCCAAAG GGCACTTCAG 2880 
GTGGCGATGG CCCTCCTGGC CCTCCAGGTG AAAGAGGTCC TCAAGGACCT CAGGGTCCAG 2940 
TTGGATTCCC TGGACCAAAA GGCCCTCCTG GACCACCAGG AAGGATGGGC TGCCCAGGAC 3000 

1 0 ACCCTGGGCA ACGTGGGGAG ACTGGATTTC AAGGCAAG AC CGGCCCTCCT GGGCCAGGGG 3060 
GAGTGGTTGG ACCACAGGGA CCAACCGGTG AGACTGGTCC AATAGGGGAA CGTGGGTATC 3 1 20 

,1 GGCCCTCCT GG'l . jTCITCCTGG 1 CTG< V AAAGAAGO IS I 

CAAAGGGTGA TCCAGGTCCT CAAGGTATCT CAGGGAAAGA TGGACCAGCA GGATTACGTG 3240 
'GTTTCCCAGG GGAAAGAGGT CTTCCTGGAG CTCAGGGTGC ACCTGGACTG AAAGGAGGGG 3300 

1 5 AAGGTCCCCA GGGCCCACCA GGTCCAGTTG GCTCACCAGG AGAACGTGGG TCAGCAGGTA 33 60 
CAGCTGGCCC AATTGGTTTA CGAGGGCGCC CGGGACCTCA GGGTCCTCCT GGTCCAGCTG 3420 
GAGAGAAAGG TGCTCCTGGA QAAAAAGC-TC CCCAAGGGCC TGCAGGQAGA GATGGAGTTC 3480 
AAGGTCCTGT TGGTCTCCCA GGGCCAGCTG GTCCTGCCGG CTCCCCTGGG GAAGACGGAG 3540 
ACAAGGGTGA AATTGGTGAG CCGGGACAAA AAGGCAGCAA GGGTGGCAAG GGAGAAAATG 3600 

20 GCCCTCCCGG TCCCCCAGGT CTTCAAGGAC CAGTTGGTGC CCCTGGAATT GCTGGAGGTG 3650 
ATGGTGAACC AGGTCCTAGA GGACAGCAGG GGATGTTTGG GCAAAAAGGT GATGAGGGTG 3720 
CCAGAGGCTT CCCTGGACCT CCTGGTCCAA TAGGTCTTCA GGGTCTGCCA GGCCCACCTG 37B0 
GTGAAAAAGG TGAAAATGGG GATGTTGGTC CATGGGGGCC ACCTGGTCCT CCAGGCCCAA 3840 
GAGGCCCTCA AGGTCCCAAT GGAGCTGATG GACCACAAGG ACCCCCAGGT TCTGTTGGTT 3900 

25 CAGTTGGTGG TGTTGGAGAA AAGGGTGAAC CTGGAGAAGC AGGAAACCCA GGGCCTCCTG 3960 
GGGAAGCAGG TGTAGGCGGT CCCAAAGGAG AAAGAGGAGA GAAAGGGGAA GCTGGTCCAC 4020 
CTGGAGCTGC TGGACCTCCA GGTGCCAAGG GGCCGCCAGG TGATGATGGC CCTAAGGGTA 4080 
ACCCGGGTCC TGTTGGTTTT CCTGGAG ATC CTGGTCCTCC TGGGGAACTT GGCCCL GCAG 4140 
GTCAAGATGG TGTTGGTGOT GACAAGGGTG AAGATGGAGA TCCTGGTCAA CCGGGTCCTC 4200 

3 0 CTGGCCCATC TGGTG AGGCT GGCCCACCAG GTCCTCCTGG AAAACG AGGT CCTCCTGGAG 4260 

CTGCAGGTGC AGAGGGAAGA CAAGGTGAAA AAGGTGCTAA GGGGGAAGCA GGTGCAGAAG 4320 
GTCCTCCTGG AAAAACCGGC CCAGTCGGTC CTCAGGGACC TGCAGGAAAG CCTGGTCCAG 43B0 
AAGGTCTTCG GGGCATCCCT GGTCCTGTGG GAGAACAAGG TCTCCCTGGA GCTGCAGGCC 4440 
AAGATGGACC ACCTGGTCCT ATGGGACCTC CTGGCTTACC TGGTCTCAAA GGTGACCCTG 4500 

3 5 GCTCCAAGGG TGAAAAGGGA CATCCTGGTT TAATTGGCCT GATTGGTCCT CCAGGAGAAC 4560 
AAGGGGAAAA AGGTGACCGA GGGCTCCCTG GAACTCAAGG ATCTCCAGGA GCAAAAGGGG 4620 
ATGGGGGAAT TCCTGGTCCT GCTGGTCCCT TAGGTCCACC TGGTCCTCCA GGCTTACCAG 4680 
GTCCTCAAGG CCCAAAGGGT AACAAAGGCT CTACTGGACC CGCTGGCCAG AAAGGTGACA 4740 
GTGGTCTTCC AGGGCCTCCT GGGCCTCCAG GTCCACCTGG TGAAGTCATT GAGCCTTTAC 4800 

40 CAATCTTGTC CTCCAAAAAA ACGAGAAGAC ATACTGAAGG CATGCAAGCA GATGCAGATG 4860 
ATAATATTCT TGATTACTCG GATGGAATGG AAGAAATATT TGGTTCCCTC AATTCCCTGA 4920 
AACAAGACAT CGAGCATATG AAATTTCCAA TGGGTACTCA GACCAATCCA GCCCGAACTT 4980 
GTAAAGACCT GCAACTCAGC CATCCTGACT TCCCAGATGG TGAATATTGG ATTGATCCTA S04O 
ACCAAGGTTG CTCAGGAGAT TCCITCAAAG TTTACTGTAA TTTCACATCT GGTCCTCAOA MOO 

45 CTTGCATTTA TCCAGACA A A AAATCTGAGti GAGTAAGAAT TTCATCATGG CCAAAGHA G A - 6 
AACCAGGAAG TTGGTTTAGT GAATTTAAGA GGGGAAAACT GCTTTCATAC TTAGATGTTG 5220 
AAGGAAATTC CATCAATATG GTGCAAATGA CATTCCTGAA ACTTCTGACT GCCTCTGCTC 5280 
GGCAAAATTT CACCTACCAC TGTCATCAGT CAGCAGCCTG GTATGATGTG TCATCAGGAA 5340 
GTTATGACAA AGCACTTCGC TTCCTGGGAT CAAATGATGA GGAGATGTCC TATGACAATA 5400 

5 0 ATCCTTTTAT CA A AACACTG TATG ATGGTT GTACGTCCAG AAAAGGCTAT G AAAAAACTG 5460 
TCATTGAAAT CAATACACCA AAAATTGATC AAGTACCTAT TGTTGATGTC ATGATCAGTG 5520 
ACTTTGGTG A TCAGA ATCAG AAGTTCGGAT TTG A AGTTGG TCCTGTTTGT TTTCTTGGCT. 5580 
AAGATTAAGA CAA AGAACAT ATCAAATCA A CAGAAAATGT ACCTTGGTGC CACCAACCCA 5640 
TTTTGTGCCA CATGC AAGTT TTGAATAAGG ATGTATGG AA AAC AACGCTG CATATACAGG 5700 

55 TACCATTTAG GAAATACCG A TGCCTTTGTG GGGGC AG AAT CACAGACAAA AGCTTTGAAA 5760 
ATCATAAAGA TATAAGTTGG TGTGGCTAAG ATGGAAACAG GGCTGATTCT TGATTCCCAA 5820 
TTCTCAACTC TCCTTTTCCT ATTTGAATTT CTTTGGTGCT GTAGAAAACA AAAAAAGAAA 5880 
AATATATATT CATAAAAAAT ATGGTGCTCA TTCTCATCCA TCCAGGATGT ACTAAAACAG 5940 
TGTGTTTAAT AAATTGTAAT TATTTTGTGT ACAGTTCTAT ACTGTTATCT GTGTCCATTT 6000 

60 CCAAAACTTG CACGTGTCCC TGAATTCCGC TGACTCTAAT TTATG AtJGAT GCCGAACTCT 6060 
GATGGCAATA ATATATGTAT TATGAAAATG AAGTTATGAT "JTCCGATGAC CCTAAGTCCC 6120 
TTICTTTGGT TAATGATGAA ATTCCTTTGT GTGTGTTT 

Seq ID NO; 122 Protein sequence: 
65 Protein Accession ft NP.001B4S 



1 11 21 31 41 51 

70 MEPWSSRWKT KK.WLWDFTVT TLAWFLFQA REVEGAAPVD VLKALDFHNS PEG1SKTTGF 60 
CTNKKNSKGS DTAYRVSK.QA QLSAPTKQLF PGGTFFEDFS IL.FTVKPKKG IQSFLLSIYN 120 
EHGIQQIGVE VGRSPVFLFE DHTGKPAPED YPLFRTVNIA DGKWHRVAIS VEKKTVTM1V 1 80 
□CKKKTTKPL DRSERAIVDT NGITVFGTRI LDEEVFEGDI QQFL1TGDPK AAYDYCEHYS 240 
PDCDSSAPKA AQAQEPQIDE YAPEDIIEYD YEYGEAEYKE AESVTEGPTV TEETIAQTEA 300 

75 NTVDDFQEYN YGTMESYQTE APRHVSGTNE PNPVEEIFTE EYLTGEDYDS QRKNSEDTLY 360 
ENKE1DGRDS DLLVDGDLGE YDFYEYKEYE DKPTSPPNEE FGPGVPAETD ITETSINGHG 420 
AYGEK.GQKGE PAVVEPGMLV EGPPGPAGPA G1MGPPGLQG PTGPPGDPGD RGPPGRPGLP 480 
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GADOLPOPPG TMLMLPFRYG GDGSKGPTIS AQEAQAQAIL QQARJALRGP PGPMGLTGRP 540 
GPVGGPGSSG AKGESGDPGP QGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAKGDR 600 
GFDGLPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGEIG PRGLPGEAGP RGLLGPRGTP 660 
GAPGQPGMAG VDGPPGPKGN MGPQGEPGPP GQQGNPGPQG LPGPQGPIGP PGEKGPQGKP 720 
5 GLAGLPGADG PPGHPGKEGQ SGEKGALGPP GPQGP1GXPG PRGVKGADGV RGLKGSKGEK 7S0 
GEDGFPGFKG DMGLKGDRGE VGQIGPRGXD GPEGPKGRAG PTGDPGPSGQ AGEKGKXGVP 840 
GLPGYPGRQG PKGSTGFPGF PGANGEKGAR GVAGKPGPRG QRGPTGPRGS RGAROPTGKP 900 
GPKGTSGGDG PPGPPGERGP QGPQGPVGFP GPKGPPGPPG RMGCPGHPGQ RGETGFQGKT 960 
GPPGPGG WO FQGPTGETGP IGERGYPGPP GPPGEQGLPG AAGKEGAKGD PGPQGISGKD 1020 

1 0 GPAGLK fiJ-J'G ERGLPGAQGA PGLKGGEGPQ GPPG? VGSI'G ERGSAGTAGP JGUiGKPGPQ J080 
GPPGPAGEKG APGEKGPQGP AGRQGVQGPV GLPGPAGPAG SPGEDGDKGE IGEPGQKGSK ] 140 
GGfCGEN 1'1'G PPG1 QGPVGA PGIAGGDGEP GPRGQQGM1-G QKGDEG ARGF PGPFGPIQLQ 1200 
GLFGPPGEKG ENGDVGPWGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGEPGEA 1260 
GNPGPPGEAG VGGPKGERGE KGEAGPPGA A GPPG AKGPPG DDGPKGNPGP VGFPGDPGPP 1 320 

1 5 GEU5PAGQDG VGGDtCGEDGD PGQPGPPGPS GEAGPPGPPG KRGPPGAAGA EGRQGEKGAK 1 380 
GEAGAEGPPG KTGPVGPQGP AGKPGPEGLR CIPGPVGEQG LFGAAGQDGP PGPMGPPGLP 1440 
GLKGDPGSKG EKGHPGLIGL 1GPPGEQGEK GDRGLPGTQG SPGAKGDGGI PGPAGFLGPP 1500 
GPPGLPGPQG PKGNKGSTGP AGQKGDSGLP GPPGPPGPPG EVIQPLPILS SKKTRRHTEG 1S60 
MQADADDNIL DYSDGMEEIF GSLNSLKQDI EHMKFPMGTQ TNPARTCKDL QLSHPDFPDG 1620 

20 EYWIDPNQGC SGDSFKVYCN FTSGGETCrY PDKKSEGVRI SSWPKEKPGS WFSEFKKGKL !680 

LSYLDVEGNS INMVQMTFLK LLTASARQKF TYHCHQSAAW YDVSSGSYDK ALRFLGSNDE 1740 
EMSYDNNPFI KTLYDGCTSR KGYEKTVIEI NTPKIDQVPI VDVMISDFGD QNQKFGFEVG 1 800 
PVCFLG 

25 Ssq ID NO: 1 23 DNA sequence 

Nucleic Acid Accession #: NM_01 5886 

Coding sequence: 485-1261 (underlined sequences correspond to start and stop codnns) 

1 11 21 31 41 51 

30 | | | | 1 I 

G AATTCCCCC CCCCCCCCCC TCACTTGGTG TGTCTATATG TCTGGCAGAC ATTATCAGCA 60 
CATTCTCTGT TGTTACCTGT GATTCATTTT TTCTTCACTC TCCAGGTGAA TTTCA ATTGC 1 20 
TGAAAATTTC CCACTGAAAA TATGCAGTAA TATATTTTOT GGTTCAGACA TTTGGGGCAA .180 
ATGGTTCACA TTCATTTTAG GGTTAGTGGT CATGCTGTTT ATTTTTCTCT GCTATACAAA 240 

35 GTTCCTCTTA GGGGTCTGCC TCATGACACT AAAAAATGAA TAGAGATTCT ACTGTAGGTT 300 
ATCTCCTAGG CTTG AGTTCA ACATTTGTTT GGATTTTTG A AG AA AGTCAA ATCAAGCAAT 360 
GCTCCCAAAT GATGTC1TTG TAAATTCATA CCCTCTGGCC CTATTTTTTT TCATAGACCC 420 
TAACTCTACC TTTCTGCTTT AAAGCAAAGT AAACTCGGTG GCCTCTTCTT CTCCACCCCT 480 
CAAAATGATA GCAATCTCTG CCGTCAGCAG TGCACTCCTG TTCTCCCTTC TCTGTGAAGC 540 

40 AAGTACCGTC GTCCTACTCA ATTCCACTGA CTCATCCCCG CCAACCAATA ATTTCACTGA 600 
TATTGAAGCA GCTCTGAAAG CACAATTAGA TTCAGCGGAT ATCCCCAAAG CCAGGCGGAA 660 
GCGCTAC ATT TCGCAG AATG ACATG ATCGC CATTCTTGAT TATCATAATC AAGTTCGGGG 720 
CAAAGTGTTC CCACCGGCAG CAAATATGGA ATATATGGTT TGGGATGAAA ATCTTGCAAA 780 
ATCCGCAGAG GCTTGGGCGG CTACTTGCAT 1TGGGACCAT GGACCTTCTT AC TTACTGAG S4C 

45 ATTTITGGGC CAAAATCTAT CTGTACGCAC TGGAAGATAT CGfl TATT 1'CCAGTJ Gl 900 
CAAGCCATGG TATGATGAAG TGAAAGATTA TGCTTTTCCA TATCCCCAGG ATTGCAACCC 960 
CAGATGTCCT ATGAGATGTT TTGGTCCCAT GTGCACACAT TATACGCAGA TGGTTTGGGC 1020 
CACTTCCAAT CGGATAGGAT GCGCAATTCA TGCTTGCCAA AACATGAATG TTTGGGGATC 1080 
TGTGTGGCGA CGTGCAQTTT ACTTGGTATG CAACTATGCC CCAAAGGGCA ATTGGATTGG 1 140 

5 0 AGAAGCACCA TATA AAGTAO GGGTACCATG TTCATCTTGT CCTCCA AGTT ATG GGGGATC 1200 
TTGTACTGAC AATCTGTGTT TTCCAGGAGT TACGTCAAAC TACCTGTACT GGTTTAAATA 1260 
AGTTTACCTT TTCCTCCAGG AAATATAATG ATTTCTGGGA ACATGOGCAT GTATATATAT 1320 
ATATGGAGAG AGAATTTTGC ACATATTATA CATATTTTGT GCTAATCTTG TTTTCCTCTT 1380 
AGTATTCCTT TGTATAAATT AGTGTTTGTC TAGCATGTTT GTTTAATCCT TTGGGAATTC 



1 11 21 31 41 51 
1 I I I I I 

M1AISAVSSA LLFSLLCEAS TVVLLNSTDS S1OTNNFTPJ EAALKAQLDS ADIPKARRKR 60 
Y1SQNDMIA1 LDYHNQVRGK VFPPAANMKY MVWDJ5NLAKS AEAWAATC1W DHGPSYLLRF 120 
LGQNLSVRTG RYRSILQLVK. PWYDEVKDYA FPYPQDCNPR CPMRCFCJPMC TI 1YTQMVWAT 1 80 
SNPJGCAIHA CQNMNVWGSV WRRAVYLVCN YAPK.GNWIGE APYKVGVFCS SCPPSYGGSC 240 
TDNLCFFGVT SNYLYWFK 

Seq ID NO: 125 DNA sequence 

Nucleic Acid Accession ft NM_0Q1793 

Coding sequence: 54-2543 (underlined sequences correspond to start and stop oodons) 
I 11 21 31 41 51 

GCGG AACACC GGCCCGCCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA GCCATGGGGC 60 
TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 120 
CCTCCGAGCC GTGCCGGGCG GTCTTCAGGG AGGCTG AAGT GACCTTGGAG GCGGGAGGCG 1 80 
CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 240 
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CAGCTCTGTT TAGCACTGAT A ATG ATGACT TCACTGTGCG GAATGGCGAG ACAGTCCAGG 300 
AAAGAAGGTC ACTGAAGGAA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 360 
GAAGACACAA GAGAGATTGG GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCAAGGGTC 420 
CCTTCCCCCA □ AG ACTGAAT CAGCTCAAGT CTAATAA AG A TAG AGACACC AAGATTTTCT 480 
5 ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAGAAGG 540 
AGACAGGCTG GTTGTTGTTG AATAAGCCAC TGGACCGGGA GGAGATTGCC AAGTATGAGC 600 
TCTTTGGCCA CGCTGTGTCA GAGAATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 660 
TCATCGTGAC CGACCAGAAT GACCACAAGC CCAAGTTTAC CCAGGACACC TTCCGAGGGA 720 
GTGTCTTAGA GGGAOTCCTA CCAGGTACTT CTGTGATGCA GGTGACAGCC ACAGATGAGG 780 

1 0 ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 840 
AOGACCCACA CGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 900 
CCAGTGGCCT GQACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 960 
TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 1020 

, ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACG AGGCCCA TGTGCCTGAG AATGCAGTGG 1 080 

1 5 GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1140 
GTGCCACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA TTTTACCATC ACCACCCACC 1200 
CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 1260 
AGCACACCCT GTACGTTGAA GTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 1320 
CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG TTTGTCCCAC 1 3B0 

20 CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 
CTGCAGAAGA CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 1500 
CAGGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 1560 
GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 1620 
ATGG AAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT G ATGTCAACG 1680 

25 ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 
ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1800 
TCACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG 1860 
TCTTGTCOCT CAAGAAGT'f C CTGAAGCAGG ATACATATGA CuTCiCACCTT TCTCTGTCTG 1920 
ACCATGGCAA CAAAGAGCAG L'TGACGGTGA TCAGGGCCAC 1 GTGTGCGAC TGCCATGGCC 1980 

3 0 ATGTCGAAAC CTGCCCTGGA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 
TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 2100 
TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCG 2160 
AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CG AGGTCTGG 2220 
AGGCCAGGCC GGAGGTGGTT CTCCGCAATG ACGTGGCACC AACCATCATC CCGACACCCA 22S0 

3 5 TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT GAGAACCTGA 2340 
AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 2400 
AGGGCAGCGG CTCCGACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 2460 
ACCAAGATTA CGATTATCTG AACGAGTGGG GCAGCCGCTT CAAGAAGCTG GCAGACATGT 2520 
ACGGTGGCGG GGAGGACGAC TAGGCGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 

40 CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 2640 
- GGAAGTGGCC GTAGCAACTT GGCGGAGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 
TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 
CACCTGGGCC AGGGTTGCCT CAGAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 
TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 

45 CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 

TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGCCG 3000 
TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 
ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 3120 

^ g G AAGGGTGAG GACA ATCGTG TATATGTACT AGAACnTTT TATTAAAG AA A 

Scq ID NO: 1 26 Protein sequence: 
Protein Accession #: NPJ1017S4 



MGLPRGPLA5 LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 
QEF ALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSFCR ILRRHKRDWV VAPISVPENG 120 
KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK ISO 
„ YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 

60 DEDDAIYTYN GVVAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQ A 300 

TDMDGDGST1' TAVA VVEI1 ,D AN DNAPMFDP QKYEAHVPEN A VGHE VQULT VTDLDAPNSP 360 
AWRATYUMG GDDGDHFTIT THPESNQOIL TTRKGLDFKA XNQfflXYVEV TNEAJ'FVLKL 420 
PTSTAT1VVH VEDVNEAPW V1TSKVVEVQ EGIPTGEI'VC VYTAEDPDKE NQK1SYRILR 480 
DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 540 

65 VNDHGPVPEP RQITICNQSP VRHVLNITDK DLSPHTSPFQ AQLTDDSDIY "WTAEVNEEGD 600 
TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKG GFILPVL 660 
GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 
GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 

^ q DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 

Seq ID NO; 127 DNA sequence 

Nucleic Acid Accession ft NM 003256. 1 

Coding sequence: 60-734 (underlined sequences correspond to start and stop codons) 

75 1 11 21 31 41 51 

1 I I I 1 I 

CCTGCTGGGG CCGTCCAGTC CCCCAGACCT CACAGGCTCA GTCGCGGATC TGCAGTGTCA 60 
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TGCCTGGGAG CCCTCGGCCC GCGCCAAGCT GGGTGCTGTT GCTGCGGCTG CTGGCGTTGC 120 
TGCGGCCCCC GGGGCTGGGT GAGGCATGCA GCTGCGCCCC GGCGCACCCT CAGCAGCACA 1 80 
TCTGCCACTC GGCACTTGTG ATTCGGGCCA AAATCTCCAG TGAGAAGGTA GTTCCGGCCA 210 
GTGCAGACCC TGCTGACACT GAAAAAATGC TCGGGTATGA AATCAAACAG ATAAAGATGT 300 
TCAAAGGGTT TGAGAAAGTC AAGGATGTTC AGTATATCTA TACGCCTTTT GACTCTTCCC 363 
TCTGTGGTGT GAAACTAGAA GCCAACAGCC AGAAGCAGTA TCTCTTGACT GGTCAGGTCC 420 
TCAGTGATGG AAAAGTCTTC ATCCATCTGT GCAACTACAT CGAGCCCTGG GAGGACCTGT 480 
CCTTGGTGCA GAGGGAAAGT CTGAATCATC ACTACCATCT GAACTGTGGC TGCCAAATCA 540 
CCACGTGCTA CACAGTACCC TGTACCATCT CGGCCCCTAA CGAGTGCCTC TGGACAGACT 600 
GGCTGTTGGA ACGA AAGCTC TATGGTTACC AGGCTCAGCA TTATGTCTGT ATGAAGCATG 660 
TTG ACGGCAC CTGCAGC'KiG TACCGGGGCC ACCTGCCTCT CAGG AAGG AG TTTGTTGACA 720 
TCGTTCAC5CC CTAOTAGGGA CCAGTGACCA TCACATCCCT TCAAGAOTCC TGAAGATCAA 780 
GCCAGTTCTC CTTCCCTGCA GAGCTTTGGC CATTACCACC TGACCTCTTG CTGCCAGCTA 840 
ATAAGAAGTG CCAAGTGGAC AGTCTGGCCA CTGTCAAGGC AGGGAAGGGG CCATGACTTT 900 
TCTGCCCTGC CCTCAGCCTG TTGCCCTGCC TCCCAAACCC CATTAGTCTA GCCTTGTAGC 960 
TGTTACTGCA AGTGTTTCTT CTGGCTTAGT CTGTTTTCTA AAGCCAGGAC TATTCCCTTT 1 020 
CCTCCCCAGG AATATGTGTT TTCCTTTGTC TTAATCGATC TGGTAGGGGA GAAATGGCGA 10SO 
ATGTCATACA CATGAGATGG TATATCCTTG CGATGTACAG AATCAGAAGG TGGTTTGACA 1 140 
GCATCATAAA CAGGCTGACT GGCAGGAATG AAAAAAAAAA AAAAAAAAA 

Seq ID NO: 128 Protein sequence: 
Protein Accession ft NP 003247. 1 

1 11 21 31 41 5i 
I I I I I I 

MPGSPRPAPS WVLLLRLLAL LRPPGLGEAC SCAPAHPQQH ICHSALV1RA KISSEKVVPA 60 
SADPADTEKM LRYEIKQ1KM FKGFEKVKDV QYIYTPFDSS LOG VKLEANS QKQYiXTGQV 1 20 
LSDGKVFHIL CNYIEI EDL v'Ql LNII II! I I 1 >! <i ^HLWTDISO 
WLLERKLYGY QAQIIYVCMKH VDCiTCSWYRG [U.PLRKEFVD IVQP 

Seq ID NO: 129 DNA sequence 

Coding sequence: 143-1591 (underlined seqacnccs correspond to start and stop codons) 

1 11 21 31 41 51 
I 1 I I I ! I 

CCACGCGTCC GCAATOAAGC CGAGTGAATG GGGGCTGAAT GTGCGAGTCC ATAGCTGAAG 60 
AGGAGCGCCA GATGGTGGAG GAATACACTT ATTTATGAAA CTGTCTTGAG TTCTTCTTGA 120 
ATTGCCAGTT TTCAGCCTCC TCATGCCTCC GTCTCCTTTA G ACG ACAGGG TAGTAGTGGC 1 80 
ACTATCTAGG CCCGTCCGAC CTCAGGATCT CAACCTTTGT TTAGACTCTA GTTACCTTGG 240 
CTCTGCCAAC CCAGGCAGTA ACAGCCACCC TCCTGTCATC GCCACCACCG TTGTGTCCCT 300 
CAAGGCTGCG AATCTGACGT ATATGCCCTC ATCCAGCGGC TCTGCCCGCT CGCTGAATTG 360 
TGGATGCAGC AGTGCCAGCT GCTGCACTGT GGCAACCTAC QACAAGGACA ATCAGGCCCA 420 
AACCCAAGCC ATTGCCGCTG GCACCACCAC CACTGCCATC GGAACCTCTA CCACCTGCCC 480 
TGCTAACCAG ATGGTCAACA ATAATGAGAA TACACKSCTUI CTAAOTOCAT CAAGTGGGGT 540 
GGGCAGCCCT GTGTCAGGGA CCCCCAAGCA GCTAGCCAGC ATCAAAATAA TCTACCCCAA 600 
TGACTTGGCA AAGAAGATGA CCAAATGCAG CAAGAGTCAC CTGCCGAGTC AGGGCCCTGT 660 
CATCATTGAC TGCAGGCCCT TCATGGAGTA CAACAAGAGT CACATCCAAG GAGCTGTCCA 720 
CATTAACTGT GCCGATAAGA TCAGCCGGCG GAG ACTGCAG CAGGGCAAGA TC ACTGTCCT 780 
AGACTTGATT TCCTGTAGGG AAGGCAAGGA CTCTTTCAAG AGGATCTTTT CCAAAGAAAT 840 
TATAGTTTAT GATGAGAATA CCAATGAACC AAGCCGAGTG ATGCCCTCCC AGCCACTTCA 900 
CATAGTCCTC GAGTCCCTGA AGAGAGAAGG CAAAGAACCT CTGGTGTTGA AAGGTGGACT 960 
TAGTAGTTTT AAGCAGA ACC ATGAAAACCT CTGTG ACA AC TCCCTCCAGC TCCAAGAGTG 1 020 
CCGGGAGGTG GGGGGCGGCG CATCCGCGGC CTCGAGCTTG CTACCTCAGC CCATCCCCAC 1080 
CACCCCTGAC ATCG AGAACG CTGAGCTCAC CCCCATCTTG CCCTTCCTGT TCCTTGGCAA 1140 
TGAGCAGGAT GCTCAGGACC TGGACACCAT GCAGCGGCTG AACATCGGCT ACGTCATCAA 1200 
CGTCACCACT CATCTTCCCC TCTACCACTA TG AG AAAGGC CTGTTCA ACT ACAAGCGGCT 1 260 
GCCAGCCACT GACAOCAACA AGCAG AACCT G CGGCAGTAC TTTGAAG AGG CTTTTGAGTT 1320 
CATTGAGGAA GCTCACCAGT GTGGGAAGGG GCTTCTCATC CACTGCCAGG CTGGGGTOTC 13S0 
CCGCTCCGCC ACCATCGTCA TCGCTTACTT GATGAAGCAC ACTCGGATGA CCATGACTGA 1440 
TGCTTATAAA TTTGTOAAAG GCAAACGACC AATTATCTCC CCA AACCTTA ACTTCATGOG 1500 
GCIAGTTGCTA GAG'ITCGAGG AACACCTAAA CAACGGTGTG ACACCGAGAA TCCTTACACC 1560 
AAAGCTGATG GGCGTGGAGA CCiGTTGTG TG A CAATGGTCT GGATGGAAAG GATTGCTGCT 1620 
CTCCATTAGG AGACAATGAG G AAGG AGG AT GGATTCTGGT TTTTTTTCTT TCTTTTTTTT 1680 
TTGTAGTTGG GAGTAAAGTT TGTGAATGGA AACAAACTTG GTTAAAC ACT TTATTTTTAA 1740 
CAAGTGTAAG AAG ACTATAC TTTTG ATGCC ATTGAGATTC ACCTTCCACA A ACTGGCCA A ! 800 
ATTA AGG AGG TTAA AG A AGT AATTTTTTTT AAGCCCAACC ATTAAAAATT TAATAC AACT 1 860 
TGGTTTCTCC CCCTTTTTCC TTTAAAGCTA NTTTGTAAAA GTTTATGAG 

Seq ID NO: 130 Protein sequence: 
Protein Accession #: NP_009138.1 

I II 21 31 41 51 
I 1 I i i I 

MPPSPLDDRV VVALSRPVRP QDLNLCLDSS YLGSANPGSN SHPPVIATTV VSLKAANLTY 60 
MPSSSGSARS LNCGCSSASC CTVATYDKDN QAQTQAIAAG TTTTAIGTST TCPANQMVNN 120 
NENTGSLSPS SGVGSPVSGT PKQLASIKII YPNDLAKKMT KCSKSHLPSQ GPVIIDCRPF ! 80 
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MEYNKSHIQG AVHiNCADKI SRRRLQQGKI TVLDLISCRE GKDSFKR1FS KEIIVYDENT 240 
NEPSRVMPSQ PLHIVLESLK REGKEPLVLK GGLSSFKQNH ENLCDNSLQL QECREVGGGA 300 
SAASSLLPQP IPTTPDIENA ELTPILPFLF LGNEQDAQDL DTMQRLNIGY VINVTTHLPL 360 
YHYEKGLFNY KRLPATDSNK QNLRQYFEEA FEFIEEAHQC GKGLLIHCQA GVSRS ATIVI 420 
5 AYLMKHTRMT MTDAYKFVKG KRPIISPNLN FMGQLLEFEE DLNNG VTFRI LTPKLMG VET 480 
VV 

Seq ID NO: 131 DNA sequence 
Nucleic Acid Accession*: NM_00S409.3 
1 0 Coding sequence: 94-378 (underlined sequences correspond to start, and stop codons) 

1 11 21 31 41 51 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 60 
1 5 CAACAGCACC AGCAGCA ACA GCAAAAAACA AACAJG AGTG TO AAGGGCAT GGCTATAGCC 1 20 
TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 180 
TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 
ATAATGTACC CAAGTAACAA CTGTGACAAA AT AG A AGTG A TTATTACCCT GAAAGAAAAT 300 
AAAGGACAAC GATGCCTA AA TCCCA AATCG AAGCAAGCA A GGCTTATAAT CAAAAAAGTT 3 60 
20 GAAAGAAAGA ATrTTTAAAA ATATCAAAAC ATATGAAGTC CTGGAAAAGG GCATCTGAAA 420 
AACCTAGAAC AAGTTTAACT GTGACTACTG AAATGACAAG AATTCTACAG TAGGAAACTG 480 
AGACTTTTCT ATGGTTTTGT GACTTTCAAC TTTTGTACAG TTATGTGAAG GATGAAAGGT 540 
GGGTGAAAGG ACCAAAAACA GAAATACAGT CTTCCTGAAT GAATGACAAT CAGAATTCCA 600 
CTGCCCAAAG GAGTCCAGCA ATTAAATGG A TTTCTAGGAA AAGCTACCTT AAG AAAGGCT 660 

2 5 GGTTACCATC GGAGTTTACA AAGTGCTTTC ACGTTCTTAC TTGTTGTATT ATACATTCAT 720 

GCATTTCTAG GCTAGAGAAC CTTCTAGATT TGATGCTTAC AACTATTCTG TTGTGACTAT 780 
GAGAACATTT CTGTGTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT 840 
CTGTGGTTAC AGTGGAGACA TTGACATTAT TACTGGAGTC AAGCXC1TAT AAGTCAAAAG 900 
CATCTATGTG TCGTAAAGCA TTCCTCAAAC ATTTT7TCAT GCAAATACAC ACTTCTTTCC 960 

3 0 CCAAATATCA TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTGGTTTGT 1020 

TTTATAACCA ATTCATTAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATACGCTA 1080 
TGGGATACTG GCAACAGTGC ACATATTTCA TAACCAAATT AGCAGCACCG GTCTTAATTT 1 140 
OATGTTTTTC AACTTTTATT CATTGAGATG TTTTGAAGCA ATTAGGATAT GTGTGTTTAC 1200 
TGTACTTTTT GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 
35 TACAAAATGT TTTTGTCTAC CAAAGAAAAA TGTTGAAAAA TAAGCAAATG TATACCTAOC 1320 
AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 
TTGTTCATGC CTATATACTG TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 1440 
GTCATTTTTT TCTCTAATAA ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 

40 Seq ID NO: 1 32 Protein sequence: 

Protein Accession #: NP.005400.1 

1 II 21 31 41 51 

45 MSVKGMAIALAVILCaWvQGFPMFKRGRCLCIGPGVKAVKVADIEKASIMYPSNNCDKI 60 
EVIITLKENK GQRCLNPKSK QARLIIKKVE RKNF 

Seq ID NO: 133 DNA sequence 
Nucleic Acid Accession #: NM_0I2342 
50 Coding sequence: 373-1 155 (underlined sequences correspond to start and stop codons) 

I U 21 31 41 51 

CTGGCGCGGG CGGGAGCTGC GGCGGATACC CTTGCGTGCT GTGGAGACCC TACTCTCTTC 60 

5 5 GCTGAGAACG GCCGCTAGCG GGGACTGAAG GCCGGGAGCC CACTCCCGAC CCGGGGCTAG 120 
CGTGCGTCCC TAGAGTCGAG CGGGGCAAGG GAGCCAGTGG CCGCCGACGG GGGACCGGGA 180 
AACTTTTCTG GGCTCCTGG A GAGCCCTGTA GCCGCGCTCC ATGCTCCGGC AGCGGCCCGA 240 
AACCCAGCCC CGCCGCTGAC GGAGCCCGCC GCTCCGGGCA GGGCCCATGC CCTGCGCGCT 300 
CCGGGGGTCG TAGCTGCCGC CGAGCCGGGG CTCCGGAAGC CGGCGGGGGC GCCGCGGCCG 360 

60 TGCGGGGCGT CAATGGATCG CCACTCCAGC TACATCTTCA TCTGGCTGCA GCTGGAGCTC 420 
TGCGCCATGG CCGTGCTGCT CACCAAAGGT GAAATTCGAT GCTACTGTGA TGCTGCCCAC 480 
TGTGTAGCCA CTGGTTATAT GTGTAAATCT GAGCTCAGf Ci ( LJ iCTTCTC'I i/ CTTCTT 540 
GATCCTCAGA ACTCAAATTC CCCACTCACC CATGGC TOCC TGGACTCTCT TGCAAGCACG 600 
ACAGACATCT GCCAAGCCAA ACAGGCCCGA AACCACTCTG GCACCACCAT ACCCACATTG 660 

65 GAATGCTGTC ATGAAGACAT GTGCAATTAC AGAGGGCTGC ACGATGTTCT CTCTCCTCCC 720 
AGGGGTGAGG GCTCAGGACA AGGAAACAGG TATCAGCATG ATGGTAGCAG AAACCTTATC 780 
ACCAAGGTGC AGGAGCTGAC TTCTTCCAAA GAGTTGTGGT TCCGGGCAGC GGTCATTGCC 840 
GTGCCCATTG CTGGAGGGCT GATTTTAGTG TTGCTTATTA TGTTGGCCCT GAGGATGCTT 900 
CGAAGTGAAA ATAAGAGGCT GCAGGATCAG CGGCAACAGA TGCTCTCCCG TTTGCACTAC 960 

7 0 AGCTTTCACG GACACCATTC CAAAAAGGG G CAGGTTGCAA AGTTAG ACTT GG AATGCATG i 020 
GTGCCGGTCA GTGGGCACGA GAACTGCTGT CTGACCTGTG ATAAAATGAG ACAAGCAGAC 1080 
CTCAGCAACG ATAAGATCCT CTGGCTTGTT CACTGGGGCA TGTACAGTGG GCACGGGAAG 1140 
CTGGAATTCG TATGACGGAG TCTTATCTGA ACTACACTTA CTGAACAGCT TGAAGGCCTT 1200 
TTG AGTTCTG CTGG ACAGGA GCACTTTATC TGAAG ACAAA CTCATTTAAT CATCTTTGAG 1 260 

75 AGACAAAATG ACCTCTGCA A ACAGAATCTT GG ATATTTCT TCTG A AGGAT TATTTG CACA 1320 
GACTTAAATA CAGTTAAATG TGTTATTTGC TTTTAAAATT ATAAAAAGCA AAGAGAAGAC 1380 
TTTGTACACA CTGTCACCAG GGTTATTTGC ATCCAAGGGA GCTGGAATTG AGTACCTAAA 1440 
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TA AACAAAA A TGTGCCCTAT GTAAGCTTCT ACATCTTGAT TTATTGTAAA GATTTAAAAG 1500 
AAATATATAT ATTTTGTCTG A 

Seq 1DNQ: 134 Protein Sequence: 
5 Protein Accession ft NP 036474.1 

1 11 21 31 41 51 

MDRHSSYIFI WLQLELCAMA VLLTKGE1RC YCDAAHCVAT GYMCKSELSA CFSRLLDPQN 60 
1 0 SNSPLTHGCL DSLASTTDIC QAKQARNHSG TTIPTLECCH EDMCNYRGLH DVLSPPRGEA 120 
SGOGNRYOJIU GSRNLITKVQ ELTSSKELWF RAAVIAVP1A GGLILVLLIM LALRMLRSEN 1 80 
KRLQDQRQQM LSRLHYSFHG HHSKKGQVAK LDLECMVPVS GHENCCLTCD KMRQADLSND 240 
K1LSLVHWGM YSGHGKLEFV 

1 5 Seq ID NO: 135 DNA sequence 

Nucleic Acid Accession ft NM.001627.1 

Coding sequence: 64-1 8 1 5 (underlined sequences correspond to start and stop codons) 



CGGGACGACG CCCCCTCCTG CGGCGTGGAC TCCGTCAGTG GCCCACCAAG A AGGAGGAGG 60 
AATATGGAAT CCAAGGGGGC CAGTTCCTGC CGTCTGCTCT TCTGCCTCTT GATCTCCGCC 120 
ACCGTCTTCA GGCCAGGCCT TGGATGGTAT ACTGTAAATT CAGCATATGG AGATACCATT 1 80 
ATCATACCTT GCCGACTTGA CGTACCTCAG AATCTCATGT TTGGCAAATG GAAATATGAA 240 

25 AAGCCCGATG GCTCCCCAGT ATTTATTGCC TTCAGATCCT CTACAAAGAA AAGTGTGCAG 300 
TACGACGATG TACCAGAATA CAAAGACAGA TTGAACCTCT CAGAAAACTA CACTTTGTCT 360 
ATCAGTAATG CAAGGATCAG TGATGAAAAG AGATTTGTGT GCATGCTAGT AACTGAGGAC 420 
A< ram iGCACCTAC I 1-CAAG I'GTTCAAGC C rf 1 * (XTGAAATi -ISO 
GTAAGCAAAG CACTGTTTCT CGAAACAGAG CAGCTAAAAA AGTTGGGTGA CTGCATTTCA 540 

3 0 GAAGACAGTT ATCCAGATGG CAATATCACA TGGTACAGGA ATGGAAAAGT GCTACATCCC 600 
CTTGAAGGAG CGGTGGTCAT AATTTTTAAA AAGGAAATGG ACCCAGTGAC TCAGCTCTAT 660 
ACCATGACTT CCACCCTGGA GTACAAGACA ACCAAGGCTG ACATACAAAT GCCATTCACC 720 
TGCTCGGTGA CATATTATGG AGCATCTGGC CAGAAAACAA TTCATTCTGA ACAGGCAGTA 780 
TTTGATATTT ACTATCCTAC AGAGCAGGTG ACAATACAAG TGCTGCCACC AAAAAATGCC 840 

35 ATCAAAGAAG GGGATAACAT CACTCTTAAA TGCTTAGGGA ATGGCAACCC TCCCCCAGAG 900 
GAATTTTTGT TTTACTTACC AGGACAGCCC GAAGGAATAA GAAGCTCAAA TACTTACACA 960 
CTGATGGATG TGAGGCGCAA TGCAACAGGA GACTACAAGT GTTCCCTGAT AGACAAAAAA 1020 
AGCATGATTG CTTCA ACAGC CATCACAGTT CACTATTTGG ATTTGTCCTT AAACCCAAGT 1 080 
GGAGAAGTGA CTAGACAGAT TGGTGATGCC CTACCCGTGT CATGCACAAT ATCTGCTAGC 1140 

40 AGGAATGCAA CTGTGGTATG GATGAAAGAT AACATCAGGC TTCGATCTAG CCCGTGATTT 1203 
TCTAGTCTTC ATTATCAGGA TGCTGGAAAC TATGTCTGCG A AACTGCTCT GCAGGAGGTT 1260 
GAAGGACTAA AGAAAAGAGA GTCATTGACT CTCATTGTAG AAGGCAAACC TCAAATAAAA 1320 
ATGACAAAGA AAACTGATCC CAGTGGACTA TCTAAAACAA TAATCTGCCA TGTGGAAGGT 1380 
TTTCCAAAGC CAGCCATTCA GTGGACAATT ACTGGCAGTG GAAGCGTCAT AAACCAAACA 1440 

45 GAGGAATCTC CTTATATTAA TGGCAGGTAT TATAGTAAAA TTATCATTTC CCCTGAAGAG 1 500 
AATGTTACAT TAACTTGCAC AGCAGAAAAC CAACTGGAGA GAACAGTAAA CTCCTTGAAT 1560 
GTCTCTGCTA TAAGTATTCC AG AACACG AT GAGGCAGACG AGATAAGTGA TG AAAACAG A 1 620 
GAAAAGGTGA ATGACCAGGC AAAACTAATT GTGGGAATCG TTGTTGGTCT CCTCCTTGCT 1680 
GCCC7TGTTG CTGGTGTCGT CTACTGGCTG TACATGAAGA AGTCAAAGAC TGCATCAAAA 1 740 

50 CATGTAAACA AGGACCTCGG TAATATGGAA OAAAACAAAA AGTTAGAAGA AAACAATCAC i 800 
AAAACTGAAG CCTAAGAGAG AAACTGTCCT AGTTGTCCAG AGATAAAAAT CATATAGACC I860 
AATTGAAGCA TGAACGTGGA TTGTATTTAA GACATAAACA AAGACATTGA CAGCAATTCA 1920 
TGGTTC AAGT ATTA AGCAGT TCATTCTACC AAGCTGTCAC AGGTTTTCAG AG AATTATCT 1 980 
CAAGTAAAAC AAATGAAATT TAATTACAAA CAATAAGAAC AAGTTTTGGC AGCCATG ATA 2040 

55 ATAGGTCATA TGTTGTGTTT GGTTCAATTT TTTTTCCGTA AATGTCTGCA CTGAGGATTT 2100 
CTTTTTGGTT TGCCTTTTAT GTAAATTTTT TACGTAGCTA TTTTTATACA CTGTAAGCTT 21 60 
TGTTCTGGGA GTTGCTGTTA ATCTGATGTA TAATGTAATG TTTTTATTTC AATTGTTTAT 2220 
ATGGATAATC TGAGCAGGTA CATTTCTGAT TCTGATTCCT ATCAGCAATG CCCCAAACTT 2280 
TCTCATAAGC ACCTAAAACC CAAAGGTGGC AGCTTGTGAA GATTGGGGAC ACTCATATTG 2340 

60 CCCTAATTAA AAACTGTGAT TTTTATCACA AGGGAGGGGA GGCCGAGAGT CAGACTGATA 2400 
GACACCATAG GAGCCGACTC TTTGATATGC CACCAGCGAA CTCTCAGAAA TAAATCACAG 2460 
ATGCATATAG ACACACATAC ATAATGGTAC TCCCAAACTG ACAATTTl AC CTATrCTGAA 2520 
AAAGACATAA AACAGAATT 

65 Seq ID NO: 136 Protein sequence: 

Protein Accession ft NPJ0I618.1 

I 11 21 31 41 51 

70 LiESKGASSCR LLFCLLISAT VFRPGLGWYT VNSAYGDTH IPCRLDVPQN LMFGKWKYEK 60 
?DGSPVHAF RSSTKKSVQY DDVPEYKDRL NLSENYTLSI SNARISDEKR FVCMLVTEDN 120 
VPEAPTIVKV FKQPSKFEIV SKALFLETEQ LKKLGDCISE DSYPDGN1TW YRNGKVLHPL 1 80 
EGAVVIIFKK EMDPVTQLYT MTSTLEYKTT KADIQMPFTC S VTYYGPSGQ KTEHSEQAVF 240 
D1YYPTEQVT IQVLPPKNAI KEGDNITLKC LGNGNPPPE5 FLFYLPGQPE GIRSSNTYTL 300 

75 MDVRRNATGD YKCSLIDKKS MIASTAITVH YLDLSLNPSG EVTRQIGDAL PVSCTISASR 360 
NATVVWMKDN IRLRSSPSFS SLHYQD AGNY VCETALQEVE GLKKRESLTL 1VEGKPQKM 420 
TKKTDPSGLS KTI1CHVEGF PKPA1QWTIT GSGSVINQTE ESPYINGRYY SK1IISPEEN 480 
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5 Seq ID NO: 137 DNA sequence 

Nucleic Acid Accession #: XM 030559 

. Coding sequence; 1-110 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 

10 1 | 1 | | | 

ATGAACCGCA CCCACCCit'iCA CUGGGCGGGC AOCOOCUiXTCQOCACTAT GGAGGTGAAG 60 
AGCAAGTTTG GAGCTGA ATT TCGTCGCTTT TCGCTGGAAA GATCAAAACC TGG AAAATTT 120 
G AGGAGTTTT ATGGATTACT ACA ACATGTT CATAAGATCC CCAATGTTGA CGTTTTGGTA I SO 
GGCTATGCAG ACATCCATGG AGACTTACTA CCTATAAATA ATGATGATAA TTATCACAAA 240 

15 GCTGTTTCAA CGGCCAATCC ACTGCTTAGG ATATTTATAC AAAAGAAGGA AGAAGCAGAC 303 
TACAGTGGCT TTGGTACAGA CACGCTAATA AAGAAGAAGA ATGTTTTAAC CAACGTATTG 360 
CGTCCTGACA ACCATAGAAA AAAGCCACAT ATAGTCATTA GTATGCCCCA AGACTTTAGA 420 
CCTGTGTCTT CTATTATAGA CGTGGATATT CTCCCAGAAA CGCATCGTAG GGTACGTCTT 480 
TACAAATACG GCACGGAGAA ACCCCTAGGA TTCTACATCC GGGATGGCTC CAGTGTCAGG 540 

2 0 ■ GTAAGACCAC ATGGCTTAG A AAAGGTTCCA GGGATCTTTA TATCCAGGCT TGTCCCAGGA 600 
GOTCTGGCTC A A AGTACAGG ACTATTAGCT GTTAATG ATG AAGTTTTAGA AGTTAATGGC 660 
ATAGAAGTTT CAGGGAAGAG CCTTGATCAA GTAACAGACA TGATGATTGC AAATAGCCGT 720 
AACCTCATCA TAACAGTGAG ACCGGCAAAC CAGAGGAATA ATGTTGTGAG GAACAGTCGG 780 
ACTTCTGGCA GTTCCGGTCA GTCTACTGAT AACAGCCTTC TTGGCTACCC ACAGCAGATT 840 

2 5 GAACCAAGCT TTGAGCCAGA GGATGAAG AC AGCG A AGAAG ATGACATTAT CATTGAAGAC 900 
AATGGAGTGC CACAGCAGAT TCCAAAAGCT GTTCCTAATA CTGAGAGCCT GGAGTCATTA 960 
ACACAGATAG AGCTAAGCTT TGAGTCTGOA CAGAATGGCT TTATTCCCTC TAATGAAGTG 1020 
AGCTTAGCAG O \TAGCAAfi CAG n , ^ C ^CGGAATTTG AAACACATGC TCCAGATCAA 1080 
AAACTCTTAG AAGAAGATGG AACAATCATA ACATTATGA 



1 11 2! 31 41 51 

I I i I 1 I 

MNRSHRHGAG SGCLGTMEVK SKFGAEFRRF SLERSKPGKF EEFYGLLQHV HKIPNVDVLV SO 
GYADIHGDLL PINNDDNYHK AVSTANPLLR iFIQKKEEAD YSAFGTDTU KKKNVLTNVL 1 20 
RPDNHRKKPH 1VISMPQDFR PVSSIIDVDI LPETHRRVKL YKYGTEKPLG FYIRDGSSVR i 80 
VTF'HGLEKVP GIFISRLVPG GLAQSTGLLA VNDBVLEVNG IEVSGKSLDQ VTDMMIANSR 240 
NLIITVRFAN QRNNVVRNSR TSGSSGQSTD NSLLGYPQQI EPSFEPEDED SEEDDHIED 300 
NGVFQQIPKA VPNTESLESL TQIELSFESG QNGF1PSNEV SLAAIASSSN TEFETHAPDQ 360 
KLLEEDGT1ITL 
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It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 

1 1 . A method of detecting a breast cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1-25. 

1 2. The method of claim 1, wherein the biological sample comprises 

2 isolated nucleic acids. 

1 3. The method of claim 2, wherein the nucleic acids are mRNA, 

1 4. The method of claim 2, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 5. The method of claim 1, wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1-25. 

1 6. The method of claim 1, wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 7. The method of claim 1 , wherein the patient is undergoing a therapeutic 

2 regimen to treat breast cancer. 

1 8. The method of claim 1 , wherein the patient is suspected of having 

2 breast cancer. 

1 9. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1 -25. 

1 10. The nucleic acid molecule of claim 9, which is labeled. 

1 1 1 . An expression vector comprising the nucleic acid of claim 9 . 

1 12. A host cell comprising the expression vector of claim 1 1 . 
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1 13. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1-25. 

1 14. An antibody that specifically binds a polypeptide of claim 13. 

1 1 5 . The antibody of claim 1 4, further conjugated to an effector component. 

1 1 6. The antibody of claim 1 5 , wherein the effector component is a 

2 fluorescent label. 

1 17. The antibody of claim 15, wherein the effector component is a 

2 radioisotope or a cytotoxic chemical. 

1 18. The antibody of claim 15, which is an antibody fragment. 

1 19. The antibody of claim 15, which is a humanized antibody 

1 20. A method of detecting a breast cancer cell in a biological sample from 

2 a patient, the method comprising contacting the biological sample with an antibody of claim 

3 14. 

1 2 1 . The method of claim 20, wherein the antibody is further conj ugated to 

2 an effector component. 

1 22. The method of claim 21, wherein the effector component is a 

2 fluorescent label, 

1 23 . A method for identifying a compound that modulates a breast cancer- 

2 associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with a breast cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1-25; and 

6 (ii) determining the functional effect of the compound upon the polypeptide, 
1 24. A drug screening assay comprising the steps of 
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2 (i) administering a test compound to a mammal having breast cancer or a cell 

3 isolated therefrom; 

4 (ii) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1 -25 in a 

6 treated cell or mammal with the level of gene expression of the polynucleotide in a control 

7 cell or mammal, wherein a test compound that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of breast cancer. 
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